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CTSZ DISRUPTIONS, COMPOSITIONS AND METHODS RELATING THERETO 

Field of the Invention 

The present invention relates compositions, including transgenic animals and methods relating 
to the characterization of gene function. 

5 Background of the Invention 

Proteases are proteolytic enzymes that catalyze the cleavage of peptide bonds in other proteins. 
The effect of such cleavage on protein molecules is diverse, hi some instances, proteolytic cleavage 
causes the cleaved protein to become inactive, hi other instances, proteolytic cleavage causes a once 
inactive protein to become activate. In yet other instances, proteolytic cleavage is a mechanism 
10 whereby a single polypeptide precursor is cleaved into two or more individual polypeptides. 

Proteases serve to degrade invading organisms, antigen-antibody complexes and certain tissue 
proteins that are no longer necessary or useful to the organism, hi a normally functioning organism, 
proteases are produced in a limited quantity and are regulated in part through the synthesis of protease 
inhibitors. A large number of naturally occurring protease inhibitors serve to control the endogenous 
15 proteases by limiting their reactions locally and temporally. In addition, the protease inhibitors may 
inhibit proteases introduced into the body by infective agents. Tissues that are particularly prone to 
proteolytic attack and infection, e.g. those of the respiratory tract, are rich in protease inhibitors. 

Proteases, such as, for example, the serine and cysteine proteases (also referred to as 
"proteinase"), have been linked to apoptosis (programmed cell death), and are believed to be involved 
20 in the development and progression of numerous diseases and disorders, including Alzheimer's and 
Parkinson's diseases, AIDS and cancer. 

One cysteine proteinase, cathepsin Z (CTSZ), is a novel cysteine proteinase with a short 
propeptide domain and a unique chromosomal location. CaUiepsin Z exhibits structural features 
characteristic of cysteine proteinases. Expression studies suggested that this enzyme could be involved 
25 in the normal intracellular protein degradation that occurs in all cell types. Cathepsin Z is also 

ubiquitously distributed in cancer cell lines and in primary tumors from different sources, suggesting 
that this enzyme may participate in tumor progression as reported for other cathepsins. (See, e.g., 
Santamaria et al., J. Biol. Chem. 273 (27): 16816 (1998)). 

As the cathepsin Z gene, may be important in biological and disease processes, a clear need 
30 exists for further in vivo characterization, which may aid in the identification and discovery of 

therapeutics and treatments useful in preventing, ameliorating or correcting dysfunctions or diseases. 

Summary of the Invention 

The present invention generally relates to transgenic animals, as well as to compositions and 
methods relating to the characterization of gene function. 
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The present invention provides transgenic cells comprising a disruption in endogenous CTSZ. 
The transgenic cells of the present invention are comprised of any cells capable of undergoing 
homologous recombination. Preferably, the cells of the present invention are stem cells and more 
preferably, embryonic stem (ES) cells, and most preferably, murine ES cells. According to one 
5 embodiment, the transgenic cells are produced by introducing a targeting construct into a stem cell to 
produce a homologous recombinant, resulting in a disruption of CTSZ. In another embodiment, the 
transgenic cells are derived from the transgenic animals described below. The cells derived from the 
transgenic animals includes cells that are isolated or present in a tissue or organ, and any cell lines or 
any progeny thereof. 

10 The present invention also provides a targeting construct and methods of producing the 

targeting construct that when introduced into stem cells produces a homologous recombinant. In one 
embodiment, the targeting construct of the present invention comprises first and second polynucleotide 
sequences that are homologous to a CTSZ gene. The targeting construct may also comprise a 
polynucleotide sequence that encodes a selectable marker that is preferably positioned between the two 

15 different homologous polynucleotide sequences in the construct. The targeting construct may also 
comprise other regulatory elements that can enhance homologous recombination. 

The present invention further provides non-human transgenic animals and methods of 
producing such non-human transgenic animals comprising a disruption in CTSZ. The transgenic 
animals of the present invention include transgenic animals that are heterozygous and homozygous for 

20 a disruption in the CTSZ gene. In one aspect, the transgenic animals of the present invention are 

defective in the function of CTSZ. In a preferred aspect, the present invention provides a transgenic 
mouse comprising a disruption in CTSZ gene, wherein there is no native expression of the endogenous 
CTSZ gene. 

The present invention also provides transgenic animals comprising a phenotype associated with 
25 having a disruption in CTSZ. Preferably, the transgenic animals are rodents and, most preferably, are 
mice. 

In one aspect of the present invention, a transgenic mouse having a disruption in the CTSZ 
gene comprises a phenotype consistent with one or more symptoms of a disease associated with CTSZ. 
In another aspect of the present invention, transgenic mice having a homozygous disruption in 

30 the CTSZ gene exhibit decreased disease severity to inflammatory bowel disease. Specifically, the 

homozygous mice exhibit increased survival and decreased body weight loss as compared to wild-type 
mice. In one embodiment, the increased survival and decreased body weight loss occurs in 
homozygous mice when exposed to DSS, a chemical inducer of inflammatory bowel disease. The 
increased survival and decreased body weight loss are consistent with symptoms of human 

35 inflammatory bowel disease (IBD), and more particularly, with Crohn's disease, the most severe type of 
IBD. These findings demonstrate the potential role of CTSZ in inflammatory bowel disease. 
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Therefore, the transgenic mice provide a valuable model for inflammatory bowel disease, which may be 
useful for evaluating and discovering treatments for inflammatory bowel disease. 

In further aspect of the present invention, transgenic mice having a homozygous disruption in 
the CTSZ gene exhibit a decreased disease progression of rheumatoid arthritis. More particularly, as 
5 compared to wild-type mice, homozygous mice exhibit no disease progression or a resistance to the 
development of rheumatoid arthritis. These findings demonstrate the potential role of CTSZ in 
rheumatoid arthritis. As such, the transgenic mice provide a valuable model for rheumatoid arthritis, 
which may be useful for evaluating and discovering treatments for rheumatoid arthritis. 

The present invention further provides methods of identifying an agent that modulates a 

10 phenotype associated with CTSZ function or activity. Methods of identifying such agents comprise 
contacting a test agent with CTSZ and determining whether the agent modulates CTSZ. 

The present invention also provides methods of identifying agents capable of affecting a 
phenotype of a transgenic animal. For example, a putative agent is administered to the transgenic 
animal and a response of the transgenic animal to the putative agent is measured and compared to the 

15 response of a "normal" or wild-type mouse, or alternatively compared to a transgenic animal control 
(without agent administration). The invention further provides agents identified according to such 
methods. The present invention also provides methods of identifying agents useful as therapeutic 
agents for treating conditions associated with a disruption or other mutation (including naturally 
occurring mutations) of the CTSZ gene. 

20 One aspect of the present invention relates to a method of identifying a potential therapeutic 

agent for the treatment of a disease associated with the CTSZ gene, in which the method includes the 
steps of: administering the potential therapeutic agent to a transgenic mouse having a disruption in a 
CTSZ gene; and determining whether the potential therapeutic agent modulates the disease associated 
with the CTSZ gene, wherein the modulation of the disease identifies a potential therapeutic agent for 

25 the treatment of that disease. 

A further aspect of the present invention provides a method of identifying a potential 
therapeutic agent for the treatment of a disease associated with the CTSZ gene, in which the method 
includes the steps of: contacting the potential therapeutic agent with CTSZ gene product; and 
determining whether the potential therapeutic agent modulates that product, wherein modulation of the 

30 gene product identifies a potential therapeutic agent for the treatment of the disease associated with the 
CTSZ gene. 

The present invention further provides a method of identifying agents that modulate the effect 
or activity of CTSZ. The method includes administering an effective amount of the agent to a 
transgenic animal, preferably a mouse. The method includes measuring a response of the transgenic 
35 animal, for example, to the agent, and comparing the response of the transgenic animal to a control 
animal, which may be, for example, a wild-type animal or alternatively, a transgenic animal control. 
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Compounds that modulate the effect or activity of CTSZ may also be screened against cells in cell- 
based assays, for example, to identify such compounds. 

The invention also provides cell lines comprising nucleic acid sequences of a CTSZ gene. 
Such cell lines may be capable of expressing such sequences by virtue of operable linkage to a 
5 promoter functional in the cell line. Preferably, expression of the CTSZ gene sequence is under the 

control of an inducible promoter. Also provided are methods of identifying agents that interact with the 
CTSZ gene, comprising the steps of contacting the CTSZ gene with an agent and detecting an 
agent/CTSZ gene complex. Such complexes can be detected by, for example, measuring expression of 
an operably linked detectable marker. 

10 The invention further provides methods of treating diseases or conditions associated with a 

disruption in a CTSZ gene, and more particularly, to a disruption or other alteration in the expression or 
function of the CTSZ gene. In a preferred embodiment, methods of the present invention involve 
treating diseases or conditions associated with a disruption or other alteration in the CTSZ gene's 
expression or function, including administering to a subject in need, a therapeutic agent that affects 

15 CTSZ expression or function. In accordance with this embodiment, the method comprises 

administration of a therapeutically effective amount of a natural, synthetic, semi-synthetic, or 
recombinant CTSZ gene, CTSZ gene products or fragments thereof as well as natural, synthetic, semi- 
synthetic or recombinant analogs. 

In one aspect of the present invention, a therapeutic agent for treating a disease associated with 

20 the CTSZ gene modulates the CTSZ gene product. Another aspect of the present invention relates to a 
therapeutic agent for treating a disease associated with the CTSZ gene, in which the agent is an agonist 
or antagonist of the CTSZ gene product. 

The present invention also provides compositions comprising or derived from ligands or other 
molecules or compounds that bind to or interact with CTSZ, including agonists or antagonists of CTSZ. 

25 Such agonists or antagonists of CTSZ include antibodies and antibody mimetics, as well as other 
molecules that can readily be identified by routine assays and experiments well known in the art. 

The present invention further provides methods of treating diseases or conditions associated 
with disrupted targeted gene expression or function, wherein the methods comprise detecting and 
replacing through gene therapy mutated or otherwise defective or abnormal CTSZ genes. 

30 Definitions 

The following terms have the meanings ascribed to them below unless specified otherwise. 

The term "gene" refers to (a) a gene containing at least one of the DNA sequences disclosed 
herein; (b) any DNA sequence that encodes the amino acid sequence encoded by the DNA sequences 
disclosed herein and/or; (c) any DNA sequence that hybridizes to the complement of the coding 
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sequences disclosed herein. Preferably, the term includes coding as well as noncoding regions, and 
preferably includes all sequences necessary for normal gene expression. 

The terms "polynucleotide" and "nucleic acid molecule" are used interchangeably to refer to 
polymeric forms of nucleotides of any length. The polynucleotides may contain deoxyribonucleotides, 
5 ribonucleotides and/or their analogs. Nucleotides may have any three-dimensional structure, and may 
perform any function, known or unknown. The term "polynucleotide" includes single-, double-stranded 
and triple helical molecules. 

"Oligonucleotide" refers to polynucleotides of between 5 and about 100 nucleotides of single- or 
double-stranded DNA. Oligonucleotides are also known as oligomers or oligos and may be isolated 

10 from genes, or chemically synthesized by methods known in the art. A "primer" refers to an 

oligonucleotide, usually single-stranded, that provides a 3'-hydroxyl end for the initiation of enzyme- 
mediated nucleic acid synthesis. The following are non-limiting embodiments of polynucleotides: a 
gene or gene fragment, exons, introns, mRNA, tRNA, rRNA, ribozymes, cDNA, recombinant 
polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated 

15 RNA of any sequence, nucleic acid probes and primers. A nucleic acid molecule may also comprise 
modified nucleic acid molecules, such as methylated nucleic acid molecules and nucleic acid molecule 
analogs. Analogs of purines and pyrimidines are known in the art, and include, but are not limited to, 
aziridinycytosine, 4-acetylcytosine, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2- 
thiouracil, 5-carboxymethyl-aminomethyluracil, inosine, N6-isopentenyladenine, 1-methyladenine, 1- 

20 methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, pseudouracil, 5-pentylnyluracil and 2,6- 
diaminopurine. The use of uracil as a substitute for thymine in a deoxyribonucleic acid is also 
considered an analogous form of pyrimidine. 

A "fragment" of a polynucleotide is a polynucleotide comprised of at least 9 contiguous 

25 nucleotides, preferably at least 15 contiguous nucleotides and more preferably at least 45 nucleotides, 
of coding or non-coding sequences. 

The term "gene targeting" refers to a type of homologous recombination that occurs when a 
fragment of genomic DNA is introduced into a mammalian cell and that fragment locates and 
recombines with endogenous homologous sequences. 

30 The term "homologous recombination" refers to the exchange of DNA fragments between two 

DNA molecules or chromatids at the site of homologous nucleotide sequences. 

The term "homologous" as used herein denotes a characteristic of a DNA sequence having at 
least about 70 percent sequence identity as compared to a reference sequence, typically at least about 85 
percent sequence identity, preferably at least about 95 percent sequence identity, and more preferably 

35 about 98 percent sequence identity, and most preferably about 100 percent sequence identity as 

compared to a reference sequence. Homology can be determined using, for example, a "BLASTN" 
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algorithm. It is understood that homologous sequences can accommodate insertions, deletions and 
substitutions in the nucleotide sequence. Thus, linear sequences of nucleotides can be essentially 
identical even if some of the nucleotide residues do not precisely correspond or align. The reference 
sequence may be a subset of a larger sequence, such as a portion of a gene or flanking sequence, or a 
5 repetitive portion of a chromosome. 

The term "target gene" (alternatively referred to as "target gene sequence" or "target DNA 
sequence" or "target sequence" or "targeting DNA") refers to any nucleic acid molecule, 
polynucleotide, or gene to be modified by homologous recombination. The target sequence includes an 
intact gene, an exon or intron, a regulatory sequence or any region between genes. The target gene may 

10 comprise a portion of a particular gene or genetic locus in the individual's genomic DNA. 

As used herein, a "variant" of a CTSZ is defined as an amino acid sequence that is different by 
one or more amino acid substitutions. The variant may have "conservative" changes, wherein a 
substituted amino acid has similar structural or chemical properties, e.g., replacement of a leucine with 
isoleucine. More rarely, a variant may have "nonconservative" changes, e.g., replacement of a glycine 

15 with a tryptophan. Similar minor variations may also include amino acid deletions or insertions, or 

both. Guidance in determining which and how many amino acid residues may be substituted, inserted 
or deleted without abolishing biological or immunological activity may be found using computer 
programs well known in the art, for example, DNAStar software. 

The term "active fragment" refers to a fragment of a CTSZ that is biologically or 

20 immunologically active. The term "biologically active" refers to a CTSZ having structural, regulatory 
or biochemical functions of the naturally occurring CTSZ. Likewise, "immunologically active" defines 
the capability of the natural, recombinant or synthetic CTSZ, or any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with specific antibodies. 

The term "derivative", as used herein, refers to the chemical modification of a nucleic acid 

25 sequence encoding a CTSZ or the encoded CTSZ. An example of such modifications would be 

replacement of hydrogen by an alkyl, acyl, or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of a natural CTSZ. 

"Disruption" of a CTSZ gene occurs when a fragment of genomic DNA locates and 
recombines with an endogenous homologous sequence. These sequence disruptions or modifications 

30 may include insertions, missense, frameshift, deletion, or substitutions, or replacements of DNA 

sequence, or any combination thereof. Lisertions include the insertion of entire genes, which may be of 
animal, plant, fungal, insect, prokaryotic, or viral origin. Disruption, for example, can alter the normal 
gene product by inhibiting its production partially or completely or by enhancing the normal gene 
product's activity. In a preferred embodiment, the disruption is a null disruption, wherein there is no 

35 significant expression of the CTSZ gene. 
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The term "native expression" refers to the expression of the full-length polypeptide encoded by 
the CTSZ gene, at expression levels present in the wild-type mouse. Thus, a disruption in which there 
is "no native expression" of the endogenous CTSZ gene refers to a partial or complete reduction of the 
expression of at least a portion of a polypeptide encoded by an endogenous CTSZ gene of a single cell, 
5 selected cells, or all of the cells of a mammal. The term "knockout" is a synonym for functional 
inactivation of the gene. 

The term "construct" or "targeting construct" refers to an artificially assembled DNA segment 
to be transferred into a target tissue, cell line or animal. Typically, the targeting construct will include a 
gene or a nucleic acid sequence of particular interest, a marker gene and appropriate control sequences. 
10 As provided herein, the targeting construct of the present invention comprises a CTSZ targeting 

construct. A "CTSZ targeting construct" includes a DNA sequence homologous to at least one portion 
of an endogenous CTSZ gene and is capable of producing a disruption in a CTSZ gene in a host cell. 

The term "transgenic cell" refers to a cell containing within its genome a CTSZ gene that has 
been disrupted, modified, altered, or replaced completely or partially by the method of gene targeting. 
15 The term "transgenic animal" refers to an animal that contains within its genome a specific 

gene that has been disrupted or otherwise modified or mutated by the method of gene targeting. 
'Transgenic animal" includes both the heterozygous animal (Le,, one defective allele and one wild-type 
allele) and the homozygous animal two defective alleles). 

As used herein, the terms "selectable marker" and "positive selection marker" refer to a gene 
20 encoding a product that enables only the cells that carry the gene to survive and/or grow under certain 
conditions. For example, plant and animal cells that express the introduced neomycin resistance {NeoT) 
gene are resistant to the compound G418. Cells that do not carry the Neo" gene marker are killed by 
G418. Other positive selection markers are known to, or are within the purview of, those of ordinary 
skill in the art. 

25 A "host cell" includes an individual cell or cell culture that can be or has been a recipient for 

vector(s) or for incorporation of nucleic acid molecules and/or proteins. Host cells include progeny of a 
single host cell, and the progeny may not necessarily be completely identical (in morphology or in total 
DNA complement) to the original parent due to natural, accidental, or deliberate mutation. A host cell 
includes cells transfected with the constructs of the present invention. 

30 The term "modulates" or "modulation" as used herein refers to the decrease, inhibition, 

reduction, amelioration, increase or enhancement of a CTSZ function, expression, activity, or 
alternatively a phenotype associated with CTSZ. 

The term "ameliorates" or "amelioration" as used herein refers to a decrease, reduction or 
elimination of a condition, disease, disorder, or phenotype, including an abnormality or symptom. 

35 The term "abnormality" refers to any disease, disorder, condition, or phenotype in which CTSZ 

is implicated, including pathological conditions and behavioral observations. 
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Brief Description of the Drawings 

Figure 1 shows a polynucleotide sequence for a CTSZ (SEQ ID NO:l). 
Figure 2 shows an amino acid sequence for a CTSZ (SEQ ID NO:2). 

Figure 3 shows the location and extent of the disrupted portion of the CTSZ gene, as well as the 
5 nucleotide sequences flanking the Neo^ insert in the targeting construct. 

Figure 4 shows the sequences identified as SEQ ID NO: 3 and SEQ ID NO:4, which were used 
as the 5'- and 3'- targeting arms (including the homologous sequences) in the CTSZ targeting construct, 
respectively. 

Figure 5 illustrates the disease progression observed in homozygous mice (-/-) relative to wild- 
10 type mice (+/+) in an antibody-induced model of rheumatoid arthritis. 

Figure 6 shows a graph comparing the weight loss observed in homozygous mice (-/-) relative 
to wild-type mice (+/+) in a chemically induced model of inflammatory bowel disease (Crohn's 
disease). 

Figure 7 shows a graph comparing the survival rate observed in homozygous mice (-/-) relative 
15 to wild-type mice (+/+) in the chemically induced model of inflammatory bowel disease. 



Detailed Description of the Invention 

The invention is based, in part, on the evaluation of the expression and role of genes and gene 
expression products, primarily those associated with a CTSZ gene. Among other uses or applications, 
20 the invention permits the definition of disease pathways and the identification of diagnostically and 
therapeutically useful targets. For example, genes that are mutated or down-regulated under disease 
conditions may be involved in causing or exacerbating the disease condition. Treatments directed at up- 
regulating the activity of such genes or treatments that involve alternate pathways, may ameliorate the 
disease condition. 

25 Generation of Targeting Construct 

The targeting construct of the present invention may be produced using standard methods 
known in the art. {see^ e.g,, Sambrook, et al, 1989, Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; E.N. Glover (eds.), 
1985, DNA Cloning: A Practical Approach, Volumes I and II; M.J. Gait (ed.), 1984, Oligonucleotide 

30 Synthesis; B.D. Hames & S.J. Higgins (eds.), 1985, Nucleic Acid Hybridization; B.D. Hames & S.J. 
Higgins (eds,), 1984, Transcription and Translation; R.I. Freshney (ed,), 1986, Animal Cell Culture; 
Inmiobilized Cells and Enzymes, IRL Press, 1986; B. Perbal, 1984, A Practical Guide To Molecular 
Cloning; F.M. Ausubel et aL, 1994, Current Protocols in Molecular Biology, John Wiley & Sons, Inc.). 
For example, the targeting construct may be prepared in accordance with conventional ways, where 

35 sequences may be synthesized, isolated from natural sources, manipulated, cloned, ligated, subjected to 
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in vitro mutagenesis, primer repair, or the like. At various stages, the joined sequences may be cloned, 

and analyzed by restriction analysis, sequencing, or the like. 

The targeting DNA can be constructed using techniques well known in the art. For example, 

the targeting DNA may be produced by chemical synthesis of oligonucleotides, nick-translation of a 
5 double-stranded DNA template, polymerase chain-reaction amplification of a sequence (or ligase chain 

reaction amplification), purification of prokaryotic or target cloning vectors harboring a sequence of 

interest (e.g,^ a cloned cDNA or genomic DNA, synthetic DNA or from any of the aforementioned 

combination) such as plasmids, phagemids, YACs, cosmids, bacteriophage DNA, other viral DNA or 

replication intermediates, or purified restriction fragments thereof, as well as other sources of single and 
10 double-stranded polynucleotides having a desired nucleotide sequence. Moreover, the length of 

homology may be selected using known methods in the art. For example, selection may be based on 

the sequence composition and complexity of the predetermined endogenous target DNA sequence(s). 
The targeting construct of the present invention typically comprises a first sequence 

homologous to a portion or region of the CTSZ gene and a second sequence homologous to a second 
15 portion or region of the CTSZ gene. The targeting construct may further comprise a positive selection 

marker, which is preferably positioned in between the first and the second DNA sequences that are 

homologous to a portion or region of the target DNA sequence. The positive selection marker may be 

operatively linked to a promoter and a polyadenylation signal. 

Other regulatory sequences known in the art may be incorporated into the targeting construct to 
20 disrupt or control expression of a particular gene in a specific cell type. In addition, the targeting 

construct may also include a sequence coding for a screening marker, for example, green fluorescent 

protein (GFP), or another modified fluorescent protein. 

Although the size of the homologous sequence is not critical and can range from as few as 

about 15-20 base pairs to as many as 100 kb, preferably each fragment is greater than about 1 kb in 
25 length, more preferably between about 1 and about 10 kb, and even more preferably between about 1 

and about 5 kb. One of skill in the art will recognize that although larger fragments may increase the 

number of homologous recombination events in ES cells, larger fragments will also be more difficult to 

clone. 

In a preferred embodiment of the present invention, the targeting construct is prepared directly 
30 from a plasmid genomic library using the methods described in pending U.S. Patent Application Ser. 
No.: 08/971,310, filed November 17, 1997, the disclosure of which is incorporated herein in its 
entirety. Generally, a sequence of interest is identified and isolated from a plasmid library in a single 
step using, for example, long-range PGR. Following isolation of this sequence, a second 
polynucleotide that will disrupt the target sequence can be readily inserted between two regions 
35 encoding the sequence of interest. In accordance with this aspect, the construct is generated in two 
steps by (1) amplifying (for example, using long-range PGR) sequences homologous to the target 
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sequence, and (2) inserting another polynucleotide (for example a selectable marker) into the PGR 
product so that it is flanked by the homologous sequences. Typically, the vector is a plasmid from a 
plasmid genomic library. The completed construct is also typically a circular plasmid. 

In another embodiment, the targeting construct is designed in accordance with the regulated 
5 positive selection method described in U.S. Patent Application Ser. No. 09/954,483, filed September 
17, 2001, the disclosure of which is incorporated herein in its entirety. The targeting construct is 
designed to include a PGK-neo fusion gene having two lacO sites, positioned in the PGK promoter and 
an NLS-lacI gene comprising a lac repressor fused to sequences encoding the NLS from the SV40 T 
antigen. 

10 In another embodiment, the targeting construct may contain more than one selectable maker 

gene, including a negative selectable marker, such as the herpes simplex virus tk (HSV-tk) gene. The 
negative selectable marker may be operatively linked to a promoter and a polyadenylation signal, (see, 
e.g., U.S. Patent No. 5,464,764; U.S. Patent No. 5,487,992; U.S. Patent No. 5,627,059; and U.S. Patent 
No. 5,631,153). 

15 

Generation of Cells and Confirmation of Homologous Recombination Events 

Once an appropriate targeting construct has been prepared, the targeting construct may be 
introduced into an appropriate host cell using any method known in the art. Various techniques may be 
employed in the present invention, including, for example: pronuclear microinjection; retrovirus 

20 mediated gene transfer into germ lines; gene targeting in embryonic stem cells; electroporation of 

embryos; sperm-mediated gene transfer; and calcium phosphate/DNA co-precipitates, microinjection of 
DNA into the nucleus, bacterial protoplast fusion with intact cells, transfection, polycations, e.g., 
polybrene, polyomithine, etc., or the like (see, e.g., U.S. Patent No. 4,873,191; Van der Putten, et al, 
1985, Proc. Nad. Acad. ScL, USA 82:6148-6152; Thompson, et al, 1989, Cell 56:313-321; Lo, 1983, 

25 Mol Cell. Biol. 3: 1803-1814; Lavitrano, et al, 1989, Cell 57:717-723). Various techniques for 
transforming mammalian cells are known in the art. (see, e.g., Gordon, 1989, Intl. Rev. CytoL, 
1 15:171-229; Keown et al., 1989, Methods in Enzymology; Keown et al., 1990, Methods and 
Enzymology, Vol. 185, pp. 527-537; Mansour et al, 1988, Nature, 336:348-352). 

In a preferred aspect of the present invention, the targeting construct is introduced into host 

30 cells by electroporation. In this process, electrical impulses of high field strength reversibly 

permeabilize biomembranes allowing the introduction of the construct. The pores created during 
electroporation permit the uptake of macromolecules such as DNA. (see, e.g.. Potter, H., et al, 1984, 
Proc. Natl Acad. Scl U.S.A. 81:7161-7165). 

Any cell type capable of homologous recombination may be used in the practice of the present 

35 invention. Examples of such target cells include cells derived from vertebrates including mammals such 
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as humans, bovine species, ovine species, murine species, simian species, and ether eucaryotic 
organisms such as filamentous fungi, and higher muhicellular organisms such as plants. 

Preferred cell types include embryonic stem (ES) cells, which are typically obtained from pre- 
implantation embryos cultured in vitro, (see, e.g., Evans, M. J., et al, 1981, Nature 292:154-156; 
5 Bradley, M. O., et a/„ 1984, Nature 309:255-258; Gossler et a/., 1986, Proc. Natl Acad, Sci, USA 
83:9065-9069; and Robertson, et al, 1986, Nature 322:445-448). The ES cells are cultured and 
prepared for introduction of the targeting construct using methods well known to the skilled artisan. 
{see, e.g., Robertson, E. J. ed. "Teratocarcinomas and Embryonic Stem Cells, a Practical Approach", 
IRL Press, Washington D.C., 1987; Bradley et al, 1986, Current Topics in Devel Biol. 20:357-371; by 

10 Hogan et ah, in "Manipulating the Mouse Embryo": A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor N.Y., 1986; Thomas et al, 1987, Cell 51:503; KoUer et al, 
1991, Proc. Natl Acad. ScL USA, 88: 10730; Dorin et al, 1992, Transgenic Res. 1: 101; and Veis et al, 
1993, Cell 75:229). The ES cells that will be inserted with the targeting construct are derived from an 
embryo or blastocyst of the same species as the developing embryo into which they are to be 

15 introduced. ES cells are typically selected for their ability to integrate into the inner cell mass and 
contribute to the germ line of an individual when introduced into the mammal in an embryo at the 
blastocyst stage of development. Thus, any ES cell line having this capability is suitable for use in the 
practice of the present invention. 

The present invention may also be used to knock out or otherwise modify or disrupt genes in 

20 other cell types, such as stem cells. By way of example, stem cells may be myeloid, lymphoid, or 

neural progenitor and precursor cells. These cells comprising a knock out, modification or disruption of 
a gene may be particularly useful in the study of CTSZ gene function in individual developmental 
pathways. Stem cells may be derived from any vertebrate species, such as mouse, rat, dog, cat, pig, 
rabbit, human, non-human primates and the like. 

25 After the targeting construct has been introduced into cells, the cells in which successful gene 

targeting has occurred are identified. Insertion of the targeting construct into the targeted gene is 
typically detected by identifying cells for expression of the marker gene. In a preferred embodiment, 
the cells transformed with the targeting construct of the present invention are subjected to treatment 
with an appropriate agent that selects against cells not expressing the selectable marker. Only those 

30 cells expressing the selectable marker gene survive and/or grow under certain conditions. For example, 
cells that express the introduced neomycin resistance gene are resistant to the compound G418, while 
cells that do not express the neo gene marker are killed by G418. If the targeting construct also 
comprises a screening marker such as GFP, homologous recombination can be identified through 
screening cell colonies under a fluorescent light. Cells that have undergone homologous recombination 

35 will have deleted the GFP gene and will not fluoresce. 
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If a regulated positive selection method is used in identifying homologous recombination 
events, the targeting construct is designed so that the expression of the selectable marker gene is 
regulated in a manner such that expression is inhibited following random integration but is permitted 
(derepressed) following homologous recombination. More particularly, the transfected cells are 
5 screened for expression of the neo gene, which requires that (1) the cell was successfully 

electroporated, and (2) lac repressor inhibition of neo transcription was relieved by homologous 
recombination. This method allows for the identification of transfected cells and homologous 
recombinants to occur in one step with the addition of a single drug. 

Alternatively, a positive-negative selection technique may be used to select homologous 

10 recombinants. This technique involves a process in which a first drug is added to the cell population, 
for example, a neomycin-like drug to select for growth of transfected cells, Le, positive selection. A 
second drug, such as FIAU is subsequently added to kill cells that express the negative selection 
marker, Le, negative selection. Cells that contain and express the negative selection marker are killed 
by a selecting agent, whereas cells that do not contain and express the negative selection marker 

15 survive. For example, cells with non-homologous insertion of the construct express HSV thymidine 
kinase and therefore are sensitive to the herpes drugs such as gancyclovir (GANG) or I^U (l-(2- 
deoxy 2-fluoro-B-D-arabinofluranosyl)-5-iodouracil). {see, e.g., Mansour et al. Nature 336:348-352: 
(1988); Gapecchi, Science 244:1288-1292, (1989); Gapecchi, Trends in Genet 5:70-76 (1989)). 

Successful recombination may be identified by analyzing the DNA of the selected cells to 

20 confirm homologous recombination. Various techniques known in the art, such as PGR and/or 
Southern analysis may be used to confirm homologous recombination events. 

Homologous recombination may also be used to disrupt genes in stem cells, and other cell 
types, which are not totipotent embryonic stem cells. By way of example, stem cells may be myeloid, 
lymphoid, or neural progenitor and precursor cells. Such transgenic cells may be particularly useful in 

25 the study of GTSZ gene function in individual developmental pathways. Stem cells may be derived 
from any vertebrate species, such as mouse, rat, dog, cat, pig, rabbit, human, non-human primates and 
the like. 

In cells that are not totipotent, it may be desirable to knock out both copies of the target gene 
using methods that are known in the art. For example, cells comprising homologous recombination at a 

30 target locus that have been selected for expression of a positive selection marker {e.g., Neo^ and 
screened for non-random integration, can be further selected for multiple copies of the selectable 
marker gene by exposure to elevated levels of the selective agent {e.g., G418). The cells are then 
analyzed for homozygosity at the target locus. Alternatively, a second construct can be generated with 
a different positive selection marker inserted between the two homologous sequences. The two 

35 constructs can be introduced into the cell either sequentially or simultaneously, followed by appropriate 
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selection for each of the positive marker genes. The final cell is screened for homologous 
recombination of both alleles of the target gene. 

Production of Transgenic Animals 

Selected cells are then injected into a blastocyst (or other stage of development suitable for the 
5 purposes of creating a viable animal, such as, for example, a morula) of an animal (e.g., a mouse) to 
form chimeras (see e.g., Bradley, A. in Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, E. J. Robertson, ed., IRL, Oxford, pp. 1 13-152 (1987)). Altematively, selected ES cells can 
be allowed to aggregate with dissociated mouse embryo cells to form the aggregation chimera. A 
chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the 

10 embryo brought to term. Chimeric progeny harbouring the homologously recombined DNA in their 
germ cells can be used to breed animals in which all cells of the animal contain the homologously 
recombined DNA. In one embodiment, chimeric progeny mice are used to generate a mouse with a 
heterozygous disruption in the CTSZ gene. Heterozygous transgenic mice can then be mated. It is well 
known in the art that typically V4 of the offspring of such matings will have a homozygous disruption in 

15 the CTSZ gene. 

The heterozygous and homozygous transgenic mice can then be compared to normal, wild-type 
mice to determine whether disruption of the CTSZ gene causes phenotypic changes, especially 
pathological changes. For example, heterozygous and homozygous mice may be evaluated for 
phenotypic changes by physical examination, necropsy, histology, clinical chemistry, complete blood 

20 count, body weight, organ weights, and cytological evaluation of bone marrow. Phenotypic changes 
may also comprise behavioral modifications or abnormalities. 

In one embodiment, the phenotype (or phenotypic change) associated with a disruption in the 
CTSZ gene is placed into or stored in a database. Preferably, the database includes: (i) genotypic data 
{e.g., identification of the disrupted gene) and (ii) phenotypic data {e.g., phenotype(s) resulting fi-om the 

25 gene disruption) associated with the genotypic data. The database is preferably electronic. In addition, 
the database is preferably combined with a search tool so that the database is searchable. 

Conditional Transgenic Animals 

The present invention further contemplates conditional transgenic or knockout animals, such as 
those produced using recombination methods. Bacteriophage PI Cre recombinase and flp recombinase 

30 from yeast plasmids are two non-limiting examples of site-specific DNA recombinase enzymes that 
cleave DNA at specific target sites (lox P sites for cre recombinase and frt sites for flp recombinase) 
and catalyze a ligation of this DNA to a second cleaved site. A large number of suitable alternative 
site-specific recombinases have been described, and their genes can be used in accordance with the 
method of the present invention. Such recombinases include the Int recombinase of bacteriophage X 

35 (with or without Xis) (Weisberg, R. et ah, in Lambda II, (Hendrix, R., et al., Eds.), Cold Spring Harbor 
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Press, Cold Spring Harbor, NY, pp. 211-50 (1983), herein incorporated by reference); Tpnl and the 
p-lactamase transposons (Mercier, et al.^ J, BacterioL, 172:3745-57 (1990)); the Tn3 resolvase 
(Flanagan & Fennewald J. Molec, Biol, 206:295-304 (1989); Stark, et ai, Cell, 58:779-90 (1989)); the 
yeast recombinases (Matsuzaki, etal, /. BacterioL, 172:610-18 (1990)); the B. subtilis SpoIVC 
5 recombinase (Sato, et al, J. Bacteriol 111: 1092-98 (1990)); the Flp recombinase (Schwartz & 

Sadowski, /. Molec.BioL, 205:647-658 (1989); Parsons, etal, J. Biol Chem,, 265:4527-33 (1990); 
Golic &Lindquist, Cell 59:499-509 (1989); Amin, etal, J. Molec, Biol, 214:55-72 (1990)); theHin 
recombinase (Glasgow, etal, J, Biol Chem., 264:10072-82 (1989)); immunoglobulin recombinases 
(Malynn, et al, CeU, 54:453-460 (1988)); and the Cin recombinase (Haffter <& Bickle, EMBO J., 

10 7:3991-3996 (1988); Hubner, et al, J. Molec, Biol, 205:493-500 (1989)), all herein incorporated by 
reference. Such systems are discussed by Echols (7. Biol Chem. 265: 14697-14700 (1990)); de 
Villartay {Nature, 335:170-74 (1988)); Craig, {Ann. Rev. Genet., 22:77-105 (1988)); Poyart-Salmeron, 
etal, {EMBO J. 8:2425-33 (1989)); Hunger-Bertling, etal.,{Mol Cell Biochem., 92:107-16 (1990)); 
and Cregg & Madden {Mol Gen. Genet, 219:320-23 (1989)), all herein incorporated by reference. 

15 Cre has been purified to homogeneity, and its reaction with the loxP site has been extensively 

characterized (Abremski & Hess J. Mol Biol 259:1509-14 (1984), herein incorporated by reference). 
Cre protein has a molecular weight of 35,000 and can be obtained commercially from New England 
Nuclear/Du Pont. The cre gene (which encodes the Cre protein) has been cloned and expressed 
(Abremski, et al. Cell 32: 1301-11 (1983), herein incorporated by reference). The Cre protein mediates 

20 recombination between two loxP sequences (Sternberg, et al. Cold Spring Harbor Symp, Quant. Biol 
45:297-309 (1981)), which may be present on the same or different DNA molecule. Because the 
internal spacer sequence of the loxP site is asynunetrical, two loxP sites can exhibit directionality 
relative to one another (Hoess & Abremski Proc. Natl Acad. Scl U.S.A. 81:1026-29 (1984)). Thus, 
when two sites on the same DNA molecule are in a directly repeated orientation, Cre will excise the 

25 DNA between the sites (Abremski, et al. Cell 32: 1301-1 1 (1983)). However, if the sites are inverted 
with respect to each other, the DNA between them is not excised after recombination but is simply 
inverted. Thus, a circular DNA molecule having two loxP sites in direct orientation will recombine to 
produce two smaller circles, whereas circular molecules having two loxP sites in an inverted orientation 
simply invert the DNA sequences flanked by the loxP sites. In addition, recombinase action can result 

30 in reciprocal exchange of regions distal to the target site when targets are present on separate DNA 
molecules. 

Recombinases have important application for characterizing gene function in knockout models. 
When the constructs described herein are used to disrupt CTSZ genes, a fusion transcript can be 
produced when insertion of the positive selection marker occurs downstream (3') of the translation 
35 initiation site of the CTSZ gene. The fusion transcript could result in some level of protein expression 
with unknown consequence. It has been suggested that insertion of a positive selection marker gene 
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can affect the expression of nearby genes. These effects may make it difficult to determine gene 
function after a knockout event since one could not discern whether a given phenotype is associated 
with the inactivation of a gene, or the transcription of nearby genes. Both potential problems are solved 
by exploiting recombinase activity. When the positive selection marker is flanked by recombinase sites 
5 in the same orientation, the addition of the corresponding recombinase will result in the removal of the 
positive selection marker. In this way, effects caused by the positive selection marker or expression of 
fusion transcripts are avoided. 

In one embodiment, purified recombinase enzyme is provided to the cell by direct 
microinjection. In another embodiment, recombinase is expressed from a co-transfected construct or 

10 vector in which the recombinase gene is operably linked to a functional promoter. An additional aspect 
of this embodiment is the use of tissue-specific or inducible recombinase constructs that allow the 
choice of when and where recombination occurs. One method for practicing the inducible forms of 
recombinase-mediated recombination involves the use of vectors that use inducible or tissue-specific 
promoters or other gene regulatory elements to express the desired recombinase activity. The inducible 

15 expression elements are preferably operatively positioned to allow the inducible control or activation of 
expression of the desired recombinase activity. Examples of such inducible promoters or other gene 
regulatory elements include, but are not limited to, tetracycline, metallothionine, ecdysone, and other 
steroid-responsive promoters, rapamycin responsive promoters, and the like (No, et ai, Proc, NatL 
Acad. Set USA, 93:3346-51 (1996); Furth, etai, Proc. NatL Acad. ScL USA, 91:9302-6 (1994)). 

20 Additional control elements that can be used include promoters requiring specific transcription factors 
such as viral, promoters. Vectors incorporating such promoters would only express recombinase 
activity in cells that express the necessary transcription factors. 

Models for Disease 

The cell- and animal-based systems described herein can be utilized as models for diseases. 

25 Animals of any species, including, but not limited to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 
goats, and non-human primates, e.g.^ baboons, monkeys, and chimpanzees may be used to generate 
disease animal models. In addition, cells from humans may be used. These systems may be used in a 
variety of applications. Such assays may be utilized as part of screening strategies designed to identify 
agents, such as compounds that are capable of ameliorating disease symptoms. Thus, the animal- and 

30 cell-based models may be used to identify drugs, pharmaceuticals, therapies and interventions that may 
be effective in treating disease. 

Cell-based systems may be used to identify compounds that may act to ameliorate disease 
symptoms. For example, such cell systems may be exposed to a compound suspected of exhibiting an 
ability to ameliorate disease symptoms, at a sufficient concentration and for a time sufficient to elicit 

35 such an amelioration of disease symptoms in the exposed cells. After exposure, the cells are examined 
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to determine whether one or more of the disease cellular phenotypes has been altered to resemble a 
more normal or more wild-type, non-disease phenotype. 

In addition, animal-based disease systems, such as those described herein, may be used to 
identify compounds capable of ameliorating disease symptoms. Such animal models may be used as 
5 test substrates for the identification of drugs, pharmaceuticals, therapies, and interventions that may be 
effective in treating a disease or other phenotypic characteristic of the animal. For example, animal 
models may be exposed to a compound or agent suspected of exhibiting an ability to ameliorate disease 
symptoms, at a sufficient concentration and for a time sufficient to elicit such an amelioration of disease 
symptoms in the exposed animals. The response of the animals to the exposure may be monitored by 

10 assessing the reversal of disorders associated with the disease. Exposure may involve treating mother 
animals during gestation of the model animals described herein, thereby exposing embryos or fetuses to 
the compound or agent that may prevent or ameliorate the disease or phenotype. Neonatal, juvenile, 
and adult animals can also be exposed. 

More particularly, using the animal models of the invention, methods of identifying agents are 

15 provided, in which such agents can be identified on the basis of their ability to affect at least one 
phenotype associated with a disruption in a CTSZ gene. In one embodiment, the present invention 
provides a method of identifying agents having an effect on CTSZ expression or function. The method 
includes measuring a physiological response of the animal, for example, to the agent and comparing the 
physiological response of such animal to a control animal, wherein the physiological response of the 

20 animal comprising a disruption in a CTSZ as compared to the control animal indicates the specificity of 
the agent. A "physiological response" is any biological or physical parameter of an animal that can be 
measured. Molecular assays (e.g., gene transcription, protein production and degradation rates), 
physical parameters (e.g., exercise physiology tests, measurement of various parameters of respiration, 
measurement of heart rate or blood pressure and measurement of bleeding time), behavioral testing, and 

25 cellular assays (e.g., immunohistochemical assays of cell surface markers, or the ability of cells to 
aggregate or proliferate) can be used to assess a physiological response. 

Additionally, the transgenic animals and cells of the present invention may be utilized as 
models for diseases, disorders, or conditions associated with phenotypes relating to a disruption in a 
CTSZ gene. 

30 The present invention provides a unique animal model for testing and developing new 

treatments relating to the behavioral phenotypes. Analysis of the behavioral phenotype allows for the 
development of an animal model useful for testing, for instance, the efficacy of proposed genetic and 
pharmacological therapies for human genetic diseases, such as neurological, neuropsychological, or 
psychotic illnesses. 

35 A statistical analysis of the various behaviors measured can be carried out using any 

conventional statistical program routinely used by those skilled in the art (such as, for example. 
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"Analysis of Variance" or ANOVA). A "p" value of about 0.05 or less is generally considered to be 
statistically significant, although slightly higher p values may still be indicative of statistically 
significant differences. To statistically analyze abnormal behavior, a comparison is made between the 
behavior of a transgenic animal (or a group thereof) to the behavior of a wild-type mouse (or a group 
5 thereof), typically under certain prescribed conditions. "Abnormal behavior" as used herein refers to 
behavior exhibited by an animal having a disruption in the CTSZ gene, e,g. transgenic animal, which 
differs from an animal without a disruption in the CTSZ gene, e.g, wild-type mouse. Abnormal 
behavior consists of any number of standard behaviors that can be objectively measured (or observed) 
and compared. In the case of comparison, it is preferred that the change be statistically significant to 

10 confirm that there is indeed a meaningful behavioral difference between the knockout animal and the 
wild-type control animal. Examples of behaviors that may be measured or observed include, but are 
not limited to, ataxia, rapid limb movement, eye movement, breathing, motor activity, cognition, 
emotional behaviors, social behaviors, hyperactivity, hypersensitivity, anxiety, impaired learning, 
abnormal reward behavior, and abnormal social interaction, such as aggression. 

15 A series of tests may be used to measure the behavioral phenotype of the animal models of the 

present invention, including neurological and neuropsychological tests to identify abnormal behavior. 
These tests may be used to measure abnormal behavior relating to, for example, learning and memory, 
eating, pain, aggression, sexual reproduction, anxiety, depression, schizophrenia, and drug abuse, {see^ 
e.g., Crawley & Paylor, Hormones and Behavior 31:197-211 (1997)). 

20 The social interaction test involves exposing a mouse to other animals in a variety of settings. 

The social behaviors of the animals (e.g., touching, climbing, sniffing, and mating) are subsequently 
evaluated. Differences in behaviors can then be statistically analyzed and compared (see, e.g., S. E. 
File, etaL, Pharmacol. Bioch. Behav. 22:941-944 (1985); R. R. Holson, Phys. Behav. 37:239-247 
(1986)). Examplary behavioral tests include the following. 

25 The mouse startle response test typically involves exposing the animal to a sensory (typically 

auditory) stimulus and measuring the startle response of the animal (see, e.g., M. A. Geyer, et ah. Brain 
Res. Bull 25:485-498 (1990); Paylor and Crawley, Psychopharmacology 132:169-180 (1997)). A pre- 
pulse inhibition test can also be used, in which the percent inhibition (from a normal startle response) is 
measured by "cueing" the animal first with a brief low-intensity pre-pulse prior to the startle pulse. 

30 The electric shock test generally involves exposure to an electrified surface and measurement 

of subsequent behaviors such as, for example, motor activity, learning, social behaviors. The behaviors 
are measured and statistically analyzed using standard statistical tests, (see, e.g., G. J. Kant, et al, 
Pharm. Bioch. Behav. 20:793-797 (1984); N. J. Leidenheimer, et al, Pharmacol. Bioch. Behav. 30:351- 
355 (1988)). 

35 The tail-pinch or immobilization test involves applying pressure to the tail of the animal and/or 

restraining the animal's movements. Motor activity, social behavior, and cognitive behavior are 
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examples of the areas that are measured, (jsee^ e,g., M. Bertolucci D'Angic, et al, Neurochem. 55:1208- 
1214(1990)). 

The novelty test generally comprises exposure to a novel environment and/or novel objects. 
The animal's motor behavior in the novel environment and/or around the novel object are measured and 
5 statistically analyzed, {see, e,g„ D. K. Reinstein, et al, Pharm, Bioch, Behav, 17:193-202 (1982); B. 
Poucet, Behav. NeuroscL 103:1009-10016 (1989); R. R. Holson, etaL, Phys. Behav, 37:231-238 
(1986)). This test may be used to detect visual processing deficiencies or defects. 

The learned helplessness test involves exposure to stresses, for example, noxious stimuli, which 
cannot be affected by the animal's behavior. The animal's behavior can be statistically analyzed using 

10 various standard statistical tests, (see^ e.g., A. Leshner, et al., Behav. Neural Biol 26:497-501 (1979)). 

Alternatively, a tail suspension test may be used, in which the "inunobile" time of the mouse is 
measured when suspended "upside-down" by its tail. This is a measure of whether the animal 
struggles, an indicator of depression. In humans, depression is believed to result from feelings of a lack 
of control over one's life or situation. It is believed that a depressive state can be elicited in animals by 

15 repeatedly subjecting them to aversive situations over which they have no control. A condition of 
"learned helplessness" is eventually reached, in which the animal will stop trying to change its 
circumstances and simply accept its fate. Animals that stop struggling sooner are believed to be more 
prone to depression. Studies have shown that the administration of certain antidepressant drugs prior to 
testing increases the amount of time that animals struggle before giving up. 

20 The Morris water-maze test comprises learning spatial orientations in water and subsequently 

measuring the animal's behaviors, such as, for example, by counting the number of incorrect choices. 
The behaviors measured are statistically analyzed using standard statistical tests, (see, e.g., E. M. 
Spruijt, etal.. Brain Res. 527:192-197 (1990)). 

Alternatively, a Y-shaped maze may be used (see, e.g., McFarland, D.J., Pharmacology, 

25 Biochemistry and Behavior 32:723-726 (1989); Dellu, F., et al.. Neurobiology of Learning and Memory 
73:31-48 (2000)). The Y-maze is generally believed to be a test of cognitive ability. The dimensions 
of each arm of the Y-maze can be, for example, approximately 40 cm x 8 cm x 20 cm, although other 
dimensions may be used. Each arm can also have, for example, sixteen equally spaced photobeams to 
automatically detect movement within the arms. At least two different tests can be performed using 

30 such a Y-maze. In a continuous Y-maze paradigm, mice are allowed to explore all three arms of a Y- 
maze for, e.g., approximately 10 minutes. The animals are continuously tracked using photobeam 
detection grids, and the data can be used to measure spontaneous alteration and positive bias behavior. 
Spontaneous alteration refers to the natural tendency of a "normal" animal to visit the least familiar arm 
of a maze. An alternation is scored when the animal makes two consecutive tums in the same direction, 

35 thus representing a sequence of visits to the least recently entered arm of the maze. Position bias 

determines egocentrically defined responses by measuring the animal's tendency to favor turning in one 
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direction over another. Therefore, the test can detect differences in an animars ability to navigate on 
the basis of allocentric or egocentric mechanisms. The two-trial Y-maze memory test measures 
response to novelty and spatial memory based on a free-choice exploration paradigm. During the first 
trial (acquisition), the animals are allowed to freely visit two arms of the Y-maze for, e.g., 
5 approximately 15 minutes. The third arm is blocked off during this trial. The second trial (retrieval) is 
performed after an intertrial interval of, e.g., approximately 2 hours. During the retrieval trial, the 
blocked arm is opened and the animal is allowed access to all three arms for, e.g., approximately 5 
minutes. Data are collected during the retrieval trial and analyzed for the number and duration of visits 
to each arm. Because the three arms of the maze are virtually identical, discrimination between novelty 

10 and familiarity is dependent on "environmental" spatial cues around the room relative to the position of 
each arm. Changes in arm entry and duration of time spent in the novel arm in a transgenic animal 
model may be indicative of a role of that gene in mediating novelty and recognition processes. 

The passive avoidance or shuttle box test generally involves exposure to two or more 
environments, one of which is noxious, providing a choice to be learned by the animal. Behavioral 

15 measures include, for example, response latency, number of correct responses, and consistency of 
response, (see, e.g., R. Ader, etaL, Psychon. Sci. 26:125-128 (1972); R. R. Holson, Phys. Behav. 
37:221-230 (1986)). Alternatively, a zero-maze can be used. In a zero-maze, the animals can, for 
example, be placed in a closed quadrant of an elevated annular platform having, e.g., 2 open and 2 
closed quadrants, and are allowed to explore for approximately 5 minutes. This paradigm exploits an 

20 approach-avoidance conflict between normal exploratory activity and an aversion to open spaces in 
rodents. This test measures anxiety levels and can be used to evaluate the effectiveness of anti- 
anxiolytic drugs. The time spent in open quadrants versus closed quadrants may be recorded 
automatically, with, for example, the placement of photobeams at each transition site. 

The food avoidance test involves exposure to novel food and objectively measuring, for 

25 example, food intake and intake latency. The behaviors measured are statistically analyzed using 

standard statistical tests, (see, e.g., B. A. Campbell, et al, J. Comp. Physiol. Psychol. 67:15-22 (1969)). 

The elevated plus-maze test comprises exposure to a maze, without sides, on a platform, the 
animal's behavior is objectively measured by counting the number of maze entries and maze learning. 
The behavior is statistically analyzed using standard statistical tests, (see, e.g., H. A. Baldwin, et al, 

30 Brain Res. Bull, 20:603-606 (1988)). 

The stimulant-induced hyperactivity test involves injection of stimulant drugs (e.g., 
amphetamines, cocaine, PCP, and the like), and objectively measuring, for example, motor activity, 
social interactions, cognitive behavior. The animal's behaviors are statistically analyzed using standard 
statistical tests, (see, e.g., R B. S. Clarke, et al, Psychopharmacology 96:511-520 (1988); P. 

35 Kuczenski, et al, J. Neuroscience 1 1:2703-2712 (1991)). 
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The self-stimulation test generally comprises providing the mouse with the opportunity to 
regulate electrical and/or chemical stimuli to its own brain. Behavior is measured by frequency and 
pattern of self-stimulation. Such behaviors are statistically analyzed using standard statistical tests, {see, 
e.g., S. Nassif, et al. Brain Res., 332:247-257 (1985); W. L. Isaac, et al, Behav. Neurosci. 103:345-355 
5 (1989)). 

The reward test involves shaping a variety of behaviors, e.g., motor, cognitive, and social, 
measuring, for example, rapidity and reliability of behavioral change, and statistically analyzing the 
behaviors measured, {see, e.g., L. E. Jarrard, et al, Exp. Brain Res. 61:519-530 (1986)), 

The DRL (differential reinforcement to low rates of responding) performance test involves 

10 exposure to intermittent reward paradigms and measuring the number of proper responses, e.g., lever 
pressing. Such behavior is statistically analyzed using standard statistical tests, {see, e.g., J. D. Sinden, 
et al, Behav. Neurosci. 100:320-329 (1986); V. Nalwa, et al, Behav Brain Res. 17:73-76 (1985); and 
A. J. Nonneman, etal, J. Comp. Physiol. Psych. 95:588-602 (1981)). 

The spatial learning test involves exposure to a complex novel environment, measuring the 

15 rapidity and extent of spatial learning, and statistically analyzing the behaviors measured, {see, e.g., N. 
Pitsikas, et al, Pharm. Bioch. Behav. 38:931-934 (1991); B. poucet, et al. Brain Res. 37:269-280 
(1990); D. Christie, et al. Brain Res. 37:263-268 (1990); and F. Van Haaren, et al, Behav. Neurosci. 
102:481-488 (1988)). Altematively, an open-field (of) test may be used, in which the greater distance 
traveled for a given amount of time is a measure of the activity level and anxiety of the animal. When 

20 the open field is a novel environment, it is believed that an approach-avoidance situation is created, in 
which the animal is "torn" between the drive to explore and the drive to protect itself. Because the 
chamber is lighted and has no places to hide other than the comers, it is expected that a "normal" mouse 
will spend more time in the comers and around the periphery than it will in the center where there is no 
place to hide. "Normal" mice will, however, venture into the central regions as they explore more and 

25 more of the chamber. It can then be extrapolated that especially anxious mice will spend most of their 
time in the comers, with relatively little or no exploration of the central region, whereas bold {i.e., less 
anxious) mice will travel a greater distance, showing little preference for the periphery versus the 
central region. 

The visual, somatosensory and auditory neglect tests generally comprise exposure to a sensory 
30 stimulus, objectively measuring, for example, orientating responses, and statistically analyzing the 
behaviors measured, {see, e.g., J. M. Vargo, etal, Exp. Neurol. 102:199-209 (1988)). 

The consummatory behavior test generally comprises feeding and drinking, and objectively 
measuring quantity of consumption. The behavior measured is statistically analyzed using standard 
statistical tests, {see, e.g., P. J. Fletcher, et al, Psychopharmacol. 102:301-308 (1990); M. G, Corda, et 
35 al„ Proc. Nafl Acad. Sci. USA 80:2072-2076 (1983)). 
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A visual discrimination test can also be used to evaluate the visual processing of an animal. 
One or two similar objects are placed in an open field and the animal is allowed to explore for about 5- 
10 minutes. The time spent exploring each object (proximity to, movement within, e.g., about 3-5 
cm of the object is considered exploration of an object) is recorded. The animal is then removed from 
5 the open field, and the objects are replaced by a similar object and a novel object. The animal is 

returned to the open field and the percent time spent exploring the novel object over the old object is 
measured (again, over about a 5-10 minute span). "Normal" animals will typically spend a higher 
percentage of time exploring the novel object rather than the old object. If a delay is imposed between 
sampling and testing, the memory task becomes more hippocampal-dependent. If no delay is imposed, 

10 the task is more based on simple visual discrimination. This test can also be used for olfactory 

discrimination, in which the objects (preferably, simple blocks) can be sprayed or otherwise treated to 
hold an odor. This test can also be used to determine if the animal can make gustatory discriminations; 
animals that return to the previously eaten food instead of novel food exhibit gustatory neophobia. 
A hot plate analgesia test can be used to evaluate an animal's sensitivity to heat or painful 

15 stimuli. For example, a mouse can be placed on an approximately 55°C hot plate and the mouse's 

response latency (e.^., time to pick up and lick a hind paw) can be recorded. These responses are not 
reflexes, but rather "higher" responses requiring cortical involvement. This test may be used to 
evaluate a nociceptive disorder. 

A tail-flick test may also be used to evaluate an animal's sensitivity to heat or painful stimuli. 

20 For example, a high-intensity thermal stimulus can be directed to the tail of a mouse and the mouse's 
response latency recorded {e.g., the time from onset of stimulation to a rapid flick/withdrawal from the 
heat source) can be recorded. These responses are simple nociceptive reflexive responses that are 
involuntary spinally mediated flexion reflexes. This test may also be sued to evaluate a nociceptive 
disorder. 

25 An accelerating rotarod test may be used to measure coordination and balance in mice. 

Animals can be, for example, placed on a rod that acts like a rotating treadmill (or rolling log). The 
rotarod can be made to rotate slowly at first and then progressively faster until it reaches a speed of, 
e.g., approximately 60 rpm. The mice must continually reposition themselves in order to avoid falling 
off. The animals are preferably tested in at least three trials, a minimum of 20 minutes apart. Those 

30 mice that are able to stay on the rod the longest are believed to have better coordination and balance. 

A metrazol administration test can be used to screen animals for varying susceptibilities to 
seizures or similar events. For example, a 5mg/ml solution of metrazol can be infused through the tail 
vein of a mouse at a rate of, e.g., approximately 0.375 ml/min. The infusion will cause all mice to 
experience seizures, followed by death. Those mice that enter the seizure stage the soonest are believed 

35 to be more prone to seizures. Four distinct physiological stages can be recorded: soon after the start of 
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infusion, the mice will exhibit a noticeable "twitch", followed by a series of seizures, ending in a final 
tensing of the body known as "tonic extension", which is followed by death. 

CTSZ Nucleic Acid Sequence and CTSZ Gene Products 

The present invention further contemplates use of the CTSZ gene sequence to produce CTSZ 
5 gene products. CTSZ gene products may include proteins that represent functionally equivalent gene 
products. Such an equivalent gene product may contain deletions, additions or substitutions of amino 
acid residues within the amino acid sequence encoded by the gene sequences described herein, but 
which result in a silent change, thus producing a functionally equivalent CTSZ gene product. Amino 
acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 

10 hydrophilicity, and/or the amphipathic nature of the residues involved. 

For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, 
threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and 

15 glutamic acid. "Functionally equivalent", as utilized herein, refers to a protein capable of exhibiting a 
substantially similar in vivo activity as the endogenous gene products encoded by the CTSZ gene 
sequences. Alternatively, when utilized as part of an assay, "functionally equivalent" may refer to 
peptides capable of interacting with other cellular or extracellular molecules in a manner substantially 
similar to the way in which the corresponding portion of the endogenous gene product would. 

20 CTSZ nucleic acid sequences and amino acid sequences may include: (1) the sequence shown 

in Figure 1 (SEQ ID NO:l) or identified in GenBank Accession No.: AJ242663.1 GI:5019547 (2) the 
CTSZ polypeptide as shown in Figure 2 (SEQ ID NO:2) or identified in GenBank Accession No.: 
CAB44494.1 GI: 50 19548; or (3) any homologues, orthologs, variants, derivatives, active fi*agments or 
mutants of CTSZ. 

25 "Percent identity" or "% identity" refers to the percentage of sequence similarity found in a 

comparison of two or more amino acid or nucleic acid sequences. Percent identity can be determined 
electronically, e.g., by using the MegAlign.TM. program (DNASTAR, Inc., Madison Wis.). The 
MegAlign.TM. program can create alignments between two or more sequences according to different 
methods, e.g., the clustal method. (See, e.g., Higgins, D. G. and P. M. Sharp (1988) Gene 73:237-244.) 

30 The clustal algorithm groups sequences into clusters by examining the distances between all pairs. The 
clusters are aligned pairwise and then in groups. The percentage similarity between two amino acid 
sequences, e.g., sequence A and sequence B, is calculated by dividing the length of sequence A, minus 
the number of gap residues in sequence A, minus the number of gap residues in sequence B, into the 
sum of the residue matches between sequence A and sequence B, times one hundred. Gaps of low or of 

35 no similarity between the two amino acid sequences are not included in determining percentage 
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similarity. Percent identity between nucleic acid sequences can also be counted or calculated by other 
methods known in the art, e.g., the Jotun Hein method. (See, e,g,, Hein, J. (1990) Methods Enzymol. 
183:626-645.) Identity between sequences can also be determined by other methods known in the art, 
e.g., by varying hybridization conditions. 
5 Substantially purified variants, preferably, having at least 90% sequence identity to a CTSZ or 

to a fragment of CTSZ may be used in the methods of identifying agents that modulate CTSZ or 
alternatively a phenotype associated with CTSZ function as disclosed by the present invention. 

Isolated and purified polynucleotides which hybridize under stringent conditions to CTSZ or a 
fragment of CTSZ, as well as an isolated and purified CTSZ polynucleotide complementary to a CTSZ 

10 polynucleotide encoding a CTSZ amino acid sequence or a fragment thereof may be used in methods of 
identifying agents that modulate CTSZ or alternatively a phenotype associated with CTSZ function as 
disclosed by the present invention. 

"Stringent conditions" refers to conditions which permit hybridization between polynucleotides 
and CTSZ polynucleotides. Stringent conditions can be defined by salt concentration, the concentration 

15 of organic solvent, e.g., formamide, temperature, and other conditions well known in the art. In 
particular, stringency can be increased by reducing the concentration of salt, increasing the 
concentration of formamide, or raising the hybridization temperature. For example, stringent salt 
concentration will ordinarily be less than about 750 mM NaCl and 75 mM trisodium citrate, preferably 
less than about 500 mM NaCl and 50 mM trisodium citrate, and most preferably less than about 250 

20 mM NaCl and 25 mM trisodium citrate. Low stringency hybridization can be obtained in the absence of 
organic solvent, e.g., formamide, while high stringency hybridization can be obtained in the presence of 
at least about 35% formamide, and most preferably at least about 50% formamide. Stringent 
temperature conditions will ordinarily include temperatures of at least about 30.degree. C, more 
preferably of at least about 37.degree. C, and most preferably of at least about 42.degree. C. Varying 

25 additional parameters, such as hybridization time, the concentration of detergent, e.g., sodium dodecyl 
sulfate (SDS), and the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. 
Various levels of stringency are accomplished by combining these various conditions as needed. In a 
preferred embodiment, hybridization will occur at 30.degree. C. in 750 mM NaCl, 75 mM trisodium 
citrate, and 1% SDS. In a more preferred embodiment, hybridization will occur at 37.degree. C. in 500 

30 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, and 100 .mu.g/ml denatured salmon 

sperm DNA (ssDNA). In a most preferred embodiment, hybridization will occur at 42.degree. C. in 250 
mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 .mu.g/ml ssDNA. Useful 
variations on these conditions will be readily apparent to those skilled in the art. 

Other protein products useful according to the methods of the invention are peptides derived 

35 from or based on the CTSZ gene products produced by recombinant or synthetic means (derived 
peptides). 
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CTSZ gene products may be produced by recombinant DNA technology using techniques well 
known in the art. Thus, methods for preparing the gene polypeptides and peptides of the invention by 
expressing nucleic acids encoding gene sequences are described herein. Methods that are well known 
to those skilled in the art can be used to construct expression vectors containing gene protein coding 
5 sequences and appropriate transcriptional/translational control signals. These methods include, for 

example, in vitro recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic 
recombination (see, e.g., Sambrook, et al, 1989, supra, and Ausubel, et al., 1989, supra). Alternatively, 
RNA capable of encoding gene protein sequences may be chemically synthesized using, for example, 
automated synthesizers (see, e.g. Oligonucleotide Synthesis: A Practical Approach, Gait, M. J. ed., IRL 

10 Press, Oxford (1984)). 

A variety of host-expression vector systems may be utilized to express the gene coding 
sequences of the invention. Such host-expression systems represent vehicles by which the coding 
sequences of interest may be produced and subsequently purified, but also represent cells that may, 
when transformed or transfected with the appropriate nucleotide coding sequences, exhibit the gene 

15 protein of the invention in situ. These include but are not limited to microorganisms such as bacteria 
(e.g., E. coli, B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA or cosmid 
DNA expression vectors containing gene protein coding sequences; yeast {e.g. Saccharomyces, Pichia) 
transformed with recombinant yeast expression vectors containing the gene protein coding sequences; 
insect cell systems infected with recombinant virus expression vectors (e.g., baculovirus) containing the 

20 gene protein coding sequences; plant cell systems infected with recombinant virus expression vectors 
{e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors {e.g., Ti plasmid) containing gene protein coding sequences; or manmialian 
cell systems {e.g. COS, CHO, BHK, 293, 3T3) harboring recombinant expression constructs containing 
promoters derived from the genome of mammalian cells {e.g., metallothionine promoter) or from 

25 mammalian viruses {e.g., the adenovirus late promoter; the vaccinia virus 7.5 K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the gene protein being expressed. For example, when a large 
quantity of such a protein is to be produced, for the generation of antibodies or to screen peptide 
libraries, for example, vectors that direct the expression of high levels of fusion protein products that 

30 are readily purified may be desirable. Such vectors include, but are not limited, to the E. coli 

expression vector pUR278 (Ruther et al, EMBO J., 2:1791-94 (1983)), in which the gene protein 
coding sequence may be ligated individually into the vector in frame with the lac Z coding region so 
that a fusion protein is produced; pIN vectors (Inouye & Inouye, Nucleic Acids Res., 13:3101-09 
(1985); Van Heeke et al, J. Biol Chem., 264:5503-9 (1989)); and the like. pGEX vectors may also be 

35 used to express foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In 

general, such fusion proteins are soluble and can easily be purified from lysed cells by adsorption to 
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glutathione-agarose beads followed by elution in the presence of free glutathione. The pGEX vectors 
are designed to include thrombin or factor Xa protease cleavage sites so that the cloned CTSZ gene 
protein can be released from the GST moiety. 

Li a preferred embodiment, full length cDNA sequences are appended with in-frame Bam HI 
5 sites at the amino terminus and Eco RI sites at the carboxyl terminus using standard PCR 

methodologies (Innis, et aL (eds) PCR Protocols: A Guide to Methods and Applications, Academic 
Press, San Diego (1990)) and ligated into the pGEX-2TK vector (Pharmacia, Uppsala, Sweden). The 
resulting cDNA construct contains a kinase recognition site at the amino terminus for radioactive 
labeling and glutathione S-transferase sequences at the carboxyl terminus for affinity purification 

10 (Nilsson, et aL, EMBO 7., 4: 1075-80 (1985); Zabeau et «/., EMBO 1: 1217-24 (1982)). 

In an insect system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as a 
vector to express foreign genes. The virus grows in Spodoptera frugiperda cells. The gene coding 
sequence may be cloned individually into non-essential regions (for example the polyhedrin gene) of 
the virus and placed under control of an AcNPV promoter (for example the polyhedrin promoter). 

15 Successful insertion of gene coding sequence will result in inactivation of the polyhedrin gene and 

production of non-occluded recombinant virus (/.e., virus lacking the proteinaceous coat coded for by 
the polyhedrin gene). These recombinant viruses are then used to infect Spodoptera frugiperda cells in 
which the inserted gene is expressed (see, e,g,. Smith, et a/., J. ViroL 46: 584-93 (1983); U.S. Patent 
No. 4,745,051). 

20 In mammalian host cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, the gene coding sequence of interest may be 
ligated to an adenovirus transcription/translation control complex, e.g.^ the late promoter and tripartite 
leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in 
vivo recombination. Insertion in a non-essential region of the viral genome {e,g,, region El or E3) will 

25 result in a recombinant virus that is viable and capable of expressing gene protein in infected hosts. 

(e.g., see Logan et aL, Proc. NatL Acad. ScL USA, 81:3655-59 (1984)). Specific initiation signals may 
also be required for efficient translation of inserted gene coding sequences. These signals include the 
ATG initiation codon and adjacent sequences. In cases where an entire gene, including its own 
initiation codon and adjacent sequences, is inserted into the appropriate expression vector, no additional 

30 translational control signals may be needed. However, in cases where only a portion of the gene coding 
sequence is inserted, exogenous translational control signals, including, perhaps, the ATG initiation 
codon, must be provided. Furthermore, the initiation codon must be in phase with the reading frame of 
the desired coding sequence to ensure translation of the entire insert. These exogenous translational 
control signals and initiation codons can be of a variety of origins, both natural and synthetic. The 

35 efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer 

elements, transcription terminators, etc. (see Bitter, et aL, Methods in EnzymoL, 153:516-44 (1987)). 
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In addition, a host cell strain may be chosen that modulates the expression of the inserted 
sequences, or modifies and processes the gene product in the specific fashion desired. Such 
modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products may be important 
for the function of the protein. Different host cells have characteristic and specific mechanisms for the 
5 post-translational processing and modification of proteins. Appropriate cell lines or host systems can 
be chosen to ensure the correct modification and processing of the foreign protein expressed. To this 
end, eukaryotic host cells that possess the cellular machinery for proper processing of the primary 
transcript, glycosylation, and phosphorylation of the gene product may be used. Such manmialian host 
cells include but are not limited to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, etc. 

10 For long-term, high-yield production of recombinant proteins, stable expression is preferred. 

For example, cell lines that stably express the gene protein may be engineered. Rather than using 
expression vectors that contain viral origins of replication, host cells can be transformed with DNA 
controlled by appropriate expression control elements (e.g., promoter, enhancer, sequences, 
transcription terminators, polyadenylation sites, etc.), and a selectable marker. Following the 

15 introduction of the foreign DNA, engineered cells may be allowed to grow for 1-2 days in an enriched 
media, and then are switched to a selective media. The selectable marker in the recombinant plasmid 
confers resistance to the selection and allows cells that stably integrate the plasmid into their 
chromosomes and grow, to form foci, which in turn can be cloned and expanded into cell lines. This 
method may advantageously be used to engineer cell lines that express the gene protein. Such 

20 engineered cell lines may be particularly useful in screening and evaluation of compounds that affect 
the endogenous activity of the gene protein. 

In a preferred embodiment, timing and/or quantity of expression of the recombinant protein can 
be controlled using an inducible expression construct. Inducible constructs and systems for inducible 
expression of recombinant proteins will be well known to those skilled in the art. Examples of such 

25 inducible promoters or other gene regulatory elements include, but are not limited to, tetracycline, 

metallothionine, ecdysone, and other steroid-responsive promoters, rapamycin responsive promoters, 
and the like (No, etal., Proc, Natl. Acad. Sci. USA, 93:3346-51 (1996); Furth, et al, Proc. Natl Acad. 
Sci. USA, 91:9302-6 (1994)), Additional control elements that can be used include promoters requiring 
specific transcription factors such as viral, particularly HIV, promoters. In one in embodiment, a Tet 

30 inducible gene expression system is utilized. (Gossen et al, Proc. Natl Acad. ScL USA, 89:5547-51 
(1992); Gossen, et al. Science, 268:1766-69 (1995)). Tet Expression Systems are based on two 
regulatory elements derived from the tetracycline-resistance operon of the E, coli TnlO transposon — the 
tetracycline repressor protein (TetR) and the tetracycline operator sequence {tetO) to which TetR binds. 
Using such a system, expression of the recombinant protein is placed under the control of the tetO 

35 operator sequence and transfected or transformed into a host cell. In the presence of TetR, which is co- 
transfected into the host cell, expression of the recombinant protein is repressed due to binding of the 
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TetR protein to the tetO regulatory element. High-level, regulated gene expression can then be induced 
in response to varying concentrations of tetracycline (Tc) or Tc derivatives such as doxycycline (Dox), 
which compete with tetO elements for binding to TetR. Constructs and materials for tet inducible gene 
expression are available commercially from CLONTECH Laboratories, Inc., Palo Alto, CA. 
5 When used as a component in an assay system, the gene protein may be labeled, either directly 

or indirectly, to facilitate detection of a complex formed between the gene protein and a test substance. 
Any of a variety of suitable labeling systems may be used including but not limited to radioisotopes 
such as ^^I; enzyme labeling systems that generate a detectable calorimetric signal or light when 
exposed to substrate; and fluorescent labels. Where recombinant DNA technology is used to produce 
10 the gene protein for such assay systems, it may be advantageous to engineer fusion proteins that can 
facilitate labeling, inmiobilization and/or detection. 

Indirect labeling involves the use of a protein, such as a labeled antibody, which specifically 
binds to the gene product. Such antibodies include but are not limited to polyclonal, monoclonal, 
chimeric, single chain. Fab fragments and fragments produced by a Fab expression library. 

15 Production of Antibodies 

Described herein are methods for the production of antibodies capable of specifically 
recognizing one or more epitopes. Such antibodies may include, but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies. 
Fab fragments, F(ab*)2 fragments, fragments produced by a Fab expression library, anti-idiotypic (anti- 

20 Id) antibodies, and epitope-binding fragments of any of the above. Such antibodies may be used, for 
example, in the detection of a CTSZ gene in a biological sample, or, alternatively, as a method for the 
inhibition of abnormal CTSZ gene activity. Thus, such antibodies may be utilized as part of disease 
treatment methods, and/or may be used as part of diagnostic techniques whereby patients may be tested 
for abnormal levels of CTSZ gene proteins, or for the presence of abnormal forms of such proteins. 

25 For the production of antibodies, various host animals may be inununized by injection with the 

CTSZ gene, its expression product or a portion thereof. Such host animals may include but are not 
limited to rabbits, mice, rats, goats and chickens, to name but a few. Various adjuvants may be used to 
increase the immunological response, depending on the host species, including but not limited to 
Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface active 

30 substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille Calmette- 
Guerin) and Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the 
sera of animals immunized with an antigen, such as CTSZ gene product, or an antigenic functional 

35 derivative thereof. For the production of polyclonal antibodies, host animals such as those described 
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above, may be immunized by injection with gene product supplemented with adjuvants as also 
described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a particular 
antigen, may be obtained by any technique that provides for the production of antibody molecules by 
5 continuous cell lines in culture. These include, but are not limited to the hybridoma technique of 
K5hler and Milstein, Nature, 256:495-7 (1975); and U.S. Patent No. 4,376,1 10), the human B-cell 
hybridoma technique (Kosbor, et ah. Immunology Today, 4:72 (1983); Cote, et al., Proc. Natl. Acad. 
ScL USA, 80:2026-30 (1983)), and the EBV-hybridoma technique (Cole, et al, in Monoclonal 
Antibodies And Cancer Therapy, Alan R. Liss, Inc., New York, pp. 77-96 (1985)). Such antibodies 

10 may be of any immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The 
hybridoma producing the mAb of this invention may be cultivated in vitro or in vivo. Production of 
high titers of mAbs in vivo makes this the presently preferred method of production. 

In addition, techniques developed for the production of "chimeric antibodies" (Morrison, et ah, 
Proc. Natl Acad. Sci„ 81:6851-6855 (1984); Takeda, etai. Nature, 314:452-54 (1985)) by splicing the 

15 genes from a mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. A chimeric antibody is a 
molecule in which different portions are derived from different animal species, such as those having a 
variable region derived from a murine mAb and a human immunoglobulin constant region. 

Alternatively, techniques described for the production of single chain antibodies (U.S. Patent 

20 No. 4,946,778; Bird, Science 242:423-26 (1988); Huston, et al, Proc, Natl Acad. ScL USA, 85:5879-83 
(1988); and Ward, et al. Nature, 334:544-46 (1989)) can be adapted to produce gene-single chain 
antibodies. Single chain antibodies are typically formed by linking the heavy and light chain fragments 
of the Fv region via an amino acid bridge, resulting in a single chain pol3^eptide. 

Antibody fragments that recognize specific epitopes may be generated by known techniques. 

25 For example, such fragments include but are not limited to: the F(ab*)2 fragments that can be produced 
by pepsin digestion of the antibody molecule and the Fab fragments that can be generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
constructed (Huse, et al. Science, 246:1275-81 (1989)) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

30 Screening Methods 

Various animal-derived "preparations," including cells and tissues, as well as cell-free extracts, 
homogenates, fractions and purified proteins, may be used to determine whether a particular agent is 
capable of modulating an activity of a CTSZ or a phenotype associated therewith. For example, such 
preparations may be generated according to methods ^yell known in the art from the tissues or organs of 

35 wild-type and knockout animals. Wild-type, but not knockout, preparations will contain endogenous 
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CTSZ, as well as the native activities, interactions and effects of the CTSZ. Thus, when knockout and 
wild-type preparations are contacted with a test agent in parallel, the ability of the test agent to 
modulate CTSZ, or a phenotype associated therewith, can be determined. Agents capable of 
modulating an activity of a CTSZ or a phenotype associated therewith are identified as those that 
5 modulate wild-type, but not knockout, preparations. Modulation may be detected, for example, as the 
ability of the agent to interact with a preparation, thereby indicating interaction with the gene product 
itself or a product thereof. Alternatively, the agent may affect a structural, metabolic or biochemical 
feature of the preparation, such as enzymatic activity of the preparation related to the CTSZ. An 
inclusive discussion of the events for which modulation by a test agent may be observed is beyond the 

10 scope of this application, but will be well known by those skilled in the art. 

The present invention may be employed in a process for screening for agents such as agonists, 
Le,, agents that bind to and activate CTSZ polypeptides, or antagonists, Le., inhibit the activity or 
interaction of CTSZ polypeptides with its ligand. Thus, polypeptides of the invention may also be used 
to assess the binding of small molecule substrates and ligands in, for example, cells, cell-free 

15 preparations, chemical libraries, and natural product mixtures as known in the art. Any methods 

routinely used to identify and screen for agents that can modulate receptors may be used in accordance 
with the present invention. 

The present invention provides methods for identifying and screening for agents that modulate 
CTSZ expression or function. More particularly, cells that contain and express CTSZ gene sequences 

20 may be used to screen for therapeutic agents. Such cells may include non-recombinant monocyte cell 
lines, such as U937 (ATCC# CRL-1593), THP-1 (ATCC# TIB-202), and P388D1 (ATCC# TIB-63); 
endothelial cells such as HUVEC's and bovine aortic endothelial cells (BAEC's); as well as generic 
marmnalian cell lines such as HeLa cells and COS cells, e.g., COS-7 (ATCC# CRL-1651). Further, 
such cells may include recombinant, transgenic cell lines. For example, the transgenic mice of the 

25 invention may be used to generate cell lines, containing one or more cell types involved in a disease, 
that can be used as cell culture models for that disorder. While cells, tissues, and primary cultures 
derived from the disease transgenic animals of the invention may be utilized, the generation of 
continuous cell lines is preferred. For examples of techniques that may be used to derive a continuous 
cell line from the transgenic animals, see Small, et al, MoL Cell BioL, 5:642-48 (1985). 

30 CTSZ gene sequences may be introduced into and overexpressed in, the genome of the cell of 

interest. In order to overexpress a CTSZ gene sequence, the coding portion of the CTSZ gene sequence 
may be ligated to a regulatory sequence that is capable of driving gene expression in the cell type of 
interest. Such regulatory regions will be well known to those of skill in the art, and may be utilized in 
the absence of undue experimentation. CTSZ gene sequences may also be disrupted or underexpressed. 

35 Cells having CTSZ gene disruptions or underexpressed CTSZ gene sequences may be used, for 
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example, to screen for agents capable of affecting alternative pathways that compensate for any loss of 
function attributable to the disruption or underexpression. 

In vitro systems may be designed to identify compounds capable of binding the CTSZ gene 
products. Such compounds may include, but are not limited to, peptides made of D-and/or L- 
5 configuration amino acids (in, for example, the form of random peptide libraries; {see e.g., Lam, et al.. 
Nature^ 354:82-4 (1991)), phosphopeptides (in, for example, the form of random or partially 
degenerate, directed phosphopeptide libraries; see, e.g., Songyang, et a/.. Cell, 72:767-78 (1993)), 
antibodies, and small organic or inorganic molecules. Compounds identified may be useful, for 
example, in modulating the activity of CTSZ gene proteins, preferably mutant CTSZ gene proteins; 

10 elaborating the biological function of the CTSZ gene protein; or screening for compounds that disrupt 
normal CTSZ gene interactions or themselves disrupt such interactions. 

The principle of the assays used to identify compounds that bind to the CTSZ gene protein 
involves preparing a reaction mixture of the CTSZ gene protein and the test compound under conditions 
and for a time sufficient to allow the two components to interact and bind, thus forming a complex that 

15 can be removed and/or detected in the reaction mixture. These assays can be conducted in a variety of 
ways. For example, one method to conduct such an assay would involve anchoring the CTSZ gene 
protein or the test substance onto a solid phase and detecting target protein/test substance complexes 
anchored on the solid phase at the end of the reaction. In one embodiment of such a method, the CTSZ 
gene protein may be anchored onto a solid surface, and the test compound, which is not anchored, may 

20 be labeled, either directly or indirectly. 

In practice, microtitre plates are conveniently utilized. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non-covalent attachment may be accomplished 
simply by coating the solid surface with a solution of the protein and drying. Alternatively, an 
immobilized antibody, preferably a monoclonal antibody, specific for the protein may be used to anchor 

25 the protein to the solid surface. The surfaces may be prepared in advance and stored. 

In order to conduct the assay, the nonimmobilized component is added to the coated surface 
containing the anchored component. After the reaction is complete, unreacted components are removed 
(e.g., by washing) under conditions such that any complexes formed will remain immobilized on the 
solid surface. The detection of complexes anchored on the solid surface can be accomplished in a 

30 number of ways. Where the previously nonimmobilized component is pre-labeled, the detection of 
label immobilized on the surface indicates that complexes were formed. Where the previously 
nonimmobilized component is not pre-labeled, an indirect label can be used to detect complexes 
anchored on the surface; e,g., using a labeled antibody specific for the previously nonimmobilized 
component (the antibody, in turn, may be directly labeled or indirectly labeled with a labeled anti-Ig 

35 antibody). 
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Alternatively, a reaction can be conducted in a liquid phase, the reaction products separated 
from unreacted components, and complexes detected; e.g,, using an immobilized antibody specific for 
CTSZ gene product or the test compound to anchor any complexes formed in solution, and a labeled 
antibody specific for the other component of the possible complex to detect anchored complexes. 
5 Compounds that are shown to bind to a particular CTSZ gene product through one of the 

methods described above can be further tested for their ability to elicit a biochemical response from the 
CTSZ gene protein. Agonists, antagonists and/or inhibitors of the expression product can be identified 
utilizing assays well known in the art. 

Antisense, Ribozymes, and Antibodies 
10 Other agents that may be used as therapeutics include the CTSZ gene, its expression product(s) 

and functional fragments thereof. Additionally, agents that reduce or inhibit mutant CTSZ gene activity 
may be used to ameliorate disease symptoms. Such agents include antisense, ribozyme, and triple helix 
molecules. Techniques for the production and use of such molecules are well known to those of skill in 
the art. 

15 Anti-sense RNA and DNA molecules act to directly block the translation of mRNA by 

hybridizing to targeted mRNA and preventing protein translation. With respect to antisense DNA, 
oligodeoxyribonucleotides derived from the translation initiation site, e.g.^ between the -10 and +10 
regions of the CTSZ gene nucleotide sequence of interest, are preferred. 

Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA. 

20 The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme molecule 
to complementary target RNA, followed by an endonucleolytic cleavage. The composition of ribozyme 
molecules must include one or more sequences complementary to the CTSZ gene mRNA, and must 
include the well known catalytic sequence responsible for mRNA cleavage. For this sequence, see U.S. 
Patent No. 5,093,246, which is incorporated by reference herein in its entirety. As such within the scope 

25 of the invention are engineered hammerhead motif ribozyme molecules that specifically and efficiently 
catalyze endonucleolytic cleavage of RNA sequences encoding CTSZ gene proteins. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the molecule of interest for ribozyme cleavage sites that include the following sequences, 
GUA, GUU and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides 

30 corresponding to the region of the CTSZ gene containing the cleavage site may be evaluated for 

predicted structural features, such as secondary structure, that may render the oligonucleotide sequence 
unsuitable. The suitability of candidate sequences may also be evaluated by testing their accessibility to 
hybridization with complementary oligonucleotides, using ribonuclease protection assays. 

Nucleic acid molecules to be used in triple helix formation for the inhibition of transcription 

35 should be single stranded and composed of deoxyribonucleotides. The base composition of these 
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oligonucleotides must be designed to promote triple helix formation via Hoogsteen base pairing rules, 
which generally require sizeable stretches of either purines or pyrimidines to be present on one strand 
of a duplex. Nucleotide sequences may be pyrimidine-based, which will result in TAT and CGC triplets 
across the three associated strands of the resulting triple helix. The pyrimidine-rich molecules provide 
5 base complementarity to a purine-rich region of a single strand of the duplex in a parallel orientation to 
that strand. In addition, nucleic acid molecules may be chosen that are purine-rich, for example, 
containing a stretch of G residues. These molecules will form a triple helix with a DNA duplex that is 
rich in GC pairs, in which the majority of the purine residues are located on a single strand of the 
targeted duplex, resulting in GGC triplets across the three strands in the triplex. 

10 Alternatively, the potential sequences that can be targeted for triple helix formation may be 

increased by creating a so called "switchback" nucleic acid molecule. Switchback molecules are 
synthesized in an alternating 5*-3', 3-5' manner, such that they base pair with first one strand of a 
duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 

15 It is possible that the antisense, ribozyme, and/or triple helix molecules described herein may 

reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by both normal and mutant CTSZ gene alleles. In order to ensure that substantially normal 
levels of CTSZ gene activity are maintained, nucleic acid molecules that encode and express CTSZ 
polypeptides exhibiting normal activity may be introduced into cells that do not contain sequences 

20 susceptible to whatever antisense, ribozyme, or triple helix treatments are being utilized. Alternatively, 
it may be preferable to coadminister normal CTSZ protein into the cell or tissue in order to maintain the 
requisite level of cellular or tissue CTSZ gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may be 
prepared by any method known in the art for the synthesis of DNA and RNA molecules. These include 

25 techniques for chemically synthesizing oligodeoxyribonucleotides and oligoribonucleotides well known 
in the art such as for example solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by in vitro and in vivo transcription of DNA sequences encoding the 
antisense RNA molecule. Such DNA sequences may be incorporated into a wide variety of vectors that 
incorporate suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 

30 Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or inducibly, 
depending on the promoter used, can be introduced stably into cell lines. 

Various well-known modifications to the DNA molecules may be introduced as a means of 
increasing intracellular stability and half-life. Possible modifications include but are not limited to the 
addition of flanking sequences of ribonucleotides or deoxyribonucleotides to the 5' and/or 3' ends of the 

35 molecule or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages within 
the oligodeoxyribonucleotide backbone. 
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Antibodies that are both specific for CTSZ protein, and in particular, the mutant CTSZ protein, 
and interfere with its activity may be used to inhibit mutant CTSZ gene function. Such antibodies may 
be generated against the proteins themselves or against peptides corresponding to portions of the 
proteins using standard techniques known in the art and as also described herein. Such antibodies 
5 include but are not limited to polyclonal, monoclonal. Fab fragments, single chain antibodies, chimeric 
antibodies, antibody mimetics, etc. 

In instances where the CTSZ protein is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. However, lipofectin liposomes may be used to deliver the 
antibody or a fragment of the Fab region that binds to the CTSZ gene epitope into cells. Where 

10 fragments of the antibody are used, the smallest inhibitory fragment that binds to the target or expanded 
target protein*s binding domain is preferred. For example, peptides having an amino acid sequence 
corresponding to the domain of the variable region of the antibody that binds to the CTSZ protein may 
be used. Such peptides may be synthesized chemically or produced via recombinant DNA technology 
using methods well known in the art (jsee^ e.g., Creighton, Proteins: Structures and Molecular Principles 

15 (1984) W.H. Freeman, New York 1983, supra-, and Sambrook, et al, 1989, supra). Alternatively, 
single chain neutralizing antibodies that bind to intracellular CTSZ gene epitopes may also be 
administered. Such single chain antibodies may be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies within the target cell population by utilizing, for example, 
techniques such as those described in Marasco, et aL, Proc, Natl Acad. ScL USA, 90:7889-93 (1993). 

20 RNA sequences encoding CTSZ protein may be directly administered to a patient exhibiting 

disease symptoms, at a concentration sufficient to produce a level of CTSZ protein such that disease 
symptoms are ameliorated. Patients may be treated by gene replacement therapy. One or more copies 
of a normal CTSZ gene, or a portion of the gene that directs the production of a normal CTSZ protein 
with CTSZ gene function, may be inserted into cells using vectors that include, but are not limited to 

25 adenovirus, adeno-associated virus, and retrovirus vectors, in addition to other particles that introduce 
DNA into cells, such as liposomes. Additionally, techniques such as those described above may be 
utilized for the introduction of normal CTSZ gene sequences into human cells. 

Cells, preferably autologous cells, containing normal CTSZ gene expressing gene sequences 
may then be introduced or reintroduced into the patient at positions that allow for the amelioration of 

30 disease symptoms. 

Pharmaceutical Compositions. Effective Dosages, and Routes of Administration 

The identified compounds that inhibit target mutant gene expression, sjmthesis and/or activity 
can be administered to a patient at therapeutically effective doses to treat or ameliorate the disease. A 
therapeutically effective dose refers to that amount of the compound sufficient to result in amelioration 
35 of symptoms of the disease. 
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Toxicity and therapeutic efficacy of such compounds can be determined by standard 
phannaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD50 (the 
dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the 
population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
5 expressed as the ratio UDsq/EDsq, Compounds that exhibit large therapeutic indices are preferred. 

While compounds that exhibit toxic side effects may be used, care should be taken to design a delivery 
system that targets such compounds to the site of affected tissue in order to minimize potential damage 
to uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in formulating a 

10 range of dosage for use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the method of the invention, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 

15 plasma concentration range that includes the IC50 (/.e., the concentration of the test compound that 

achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can 
be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

Pharmaceutical compositions for use in accordance with the present invention may be 

20 formulated in conventional manner using one or more physiologically acceptable carriers or excipients. 
Thus, the compounds and their physiologically acceptable salts and solvates may be formulated for 
administration by inhalation or insufflation (either through the mouth or the nose) or oral, buccal, 
parenteral, topical, subcutaneous, intraperitoneal, intraveneous, intrapleural, intraoccular, intraarterial, 
or rectal administration. It is also contemplated that pharmaceutical compositions may be administered 

25 with other products that potentiate the activity of the compound and optionally, may include other 
therapeutic ingredients. 

For oral administration, the pharmaceutical compositions may take the form of, for example, 
tablets or capsules prepared by conventional means with pharmaceutically acceptable excipients such as 
binding agents (e.g,, pregelatinised maize starch, polyvinylpyrrolidone or hydroxypropyl 

30 methylcellulose); fillers (e.g., lactose, microcrystalline cellulose or calcium hydrogen phosphate); 

lubricants (e.g., magnesium stearate, talc or silica); disintegrants (e.g., potato starch or sodium starch 
glycolate); or wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well 
known in the art. Liquid preparations for oral administration may take the form of, for example, 
solutions, syrups or suspensions, or they may be presented as a dry product for constitution with water 

35 or other suitable vehicle before use. Such liquid preparations may be prepared by conventional means 
with pharmaceutically acceptable additives such as suspending agents (e.g., sorbitol syrup, cellulose 
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derivatives or hydrogenated edible fats); emulsifying agents (e.g.y lecithin or acacia); non-aqueous 
vehicles (e.g., almond oil, oily esters, ethyl alcohol or fractionated vegetable oils); and preservatives 
(e,g., methyl or propyl-p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as appropriate. 
5 Preparations for oral administration may be suitably formulated to give controlled release of the 

active compound. 

For buccal administration the compositions may take the form of tablets or lozenges formulated 
in conventional manner. 

For administration by inhalation, the compounds for use according to the present invention are 

10 conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a 

nebuliser, with the use of a suitable propellant, e.g.^ dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol the 
dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and 
cartridges of e.g. gelatin for use in an inhaler or insufflator may be formulated containing a powder mix 

IS of the compound and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g.^ by bolus 
injection or continuous infusion. Formulations for injection may be presented in unit dosage form, e.g., 
in ampoules or in multi-dose containers, with an added preservative. The compositions may take such 
forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory 

20 agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the active ingredient 

may be in powder form for constitution with a suitable vehicle, e.g.^ sterile pyrogen-free water, before 
use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 

25 glycerides. Oral ingestion is possibly the easiest method of taking any medication. Such a route of 
administration, is generally simple and straightforward and is frequently the least inconvenient or 
unpleasant route of administration from the patient's point of view. However, this involves passing the 
material through the stomach, which is a hostile environment for many materials, including proteins 
and other biologically active compositions. As the acidic, hydrolytic and proteolytic environment of 

30 the stomach has evolved efficiently to digest proteinaceous materials into amino acids and 

oligopeptides for subsequent anabolism, it is hardly surprising that very little or any of a wide variety of 
biologically active proteinaceous material, if simply taken orally, would survive its passage through the 
stomach to be taken up by the body in the small intestine. The result, is that many proteinaceous 
medicaments must be taken in through another method, such as parenterally, often by subcutaneous, 

35 intramuscular or intravenous injection. 
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Pharmaceutical compositions may also include various buffers (e.^., Tris, acetate, phosphate), 
solubilizers (e,g.y Tween, Polysorbate), carriers such as human serum albumin, preservatives 
(thimerosol, benzyl alcohol) and anti-oxidants such as ascorbic acid in order to stabilize pharmaceutical 
activity. The stabilizing agent may be a detergent, such as tween-20, tween-80, NP-40 or Triton X-100. 
5 EBP may also be incorporated into particulate preparations of polymeric compounds for controlled 
delivery to a patient over an extended period of time. A more extensive survey of components in 
pharmaceutical compositions is found in Remington's Pharmaceutical Sciences, 18th ed., A. R. 
Gennaro, ed.. Mack Publishing, Easton, Pa. (1990). 

In addition to the formulations described previously, the compounds may also be formulated as 
10 a depot preparation. Such long acting formulations may be administered by implantation (for example, 
subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the compounds 
may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an 
acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly 
soluble salt. 

15 The compositions may, if desired, be presented in a pack or dispenser device that may contain 

one or more unit dosage forms containing the active ingredient. The pack may for example comprise 
metal or plastic foil, such as a blister pack. The pack or dispenser device may be accompanied by 
instructions for administration. 

Diagnostics 

20 A variety of methods may be employed to diagnose disease conditions associated with the 

CTSZ gene. Specifically, reagents may be used, for example, for the detection of the presence of CTSZ 
gene mutations, or the detection of either over- or under- expression of CTSZ gene mRNA. 

According to the diagnostic and prognostic method of the present invention, alteration of the 
wild-type CTSZ gene locus is detected. In addition, the method can be performed by detecting the 

25 wild-type CTSZ gene locus and confirming the lack of a predisposition or neoplasia. "Alteration of a 
wild-type gene" encompasses all forms of mutations including deletions, insertions and point mutations 
in the coding and noncoding regions. Deletions may be of the entire gene or only a portion of the gene. 
Point mutations may result in stop codons, frameshift mutations or amino acid substitutions. Somatic 
mutations are those that occur only in certain tissues, e.g., in tumor tissue, and are not inherited in the 

30 germline. Germline mutations can be found in any of a body's tissues and are inherited. If only a single 
allele is somatically mutated, an early neoplastic state may be indicated. However, if both alleles are 
mutated, then a late neoplastic state may be indicated. The finding of gene mutations thus provides 
both diagnostic and prognostic information, a CTSZ gene allele that is not deleted (e.g., that found on 
the sister chromosome to a chromosome carrying a CTSZ gene deletion) can be screened for other 

35 mutations, such as insertions, small deletions, and point mutations. Mutations found in tumor tissues 
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may be linked to decreased expression of the CTSZ gene product. However, mutations leading to non- 
functional gene products may also be linked to a cancerous state. Point mutational events may occur in 
regulatory regions, such as in the promoter of the gene, leading to loss or diminution of expression of 
the mRNA. Point mutations may also abolish proper RNA processing, leading to loss of expression of 
5 the CTSZ gene product, or a decrease in mRNA stability or translation efficiency. 

One test available for detecting mutations in a candidate locus is to directly compare genomic 
target sequences from cancer patients with those from a control population. Alternatively, one could 
sequence messenger RNA after amplification, e,g,^ by PGR, thereby eliminating the necessity of 
determining the exon structure of the candidate gene. Mutations from cancer patients falling outside 

10 the coding region of the CTSZ gene can be detected by examining the non-coding regions, such as 

introns and regulatory sequences near or within the CTSZ gene. An early indication that mutations in 
noncoding regions are important may come from Northern blot experiments that reveal messenger RNA 
molecules of abnormal size or abundance in cancer patients as compared to control individuals. 

The methods described herein may be performed, for example, by utilizing pre-packaged 

15 diagnostic kits comprising at least one specific gene nucleic acid or anti-gene antibody reagent 
described herein, which may be conveniently used, e.g,^ in clinical settings, to diagnose patients 
exhibiting disease symptoms or at risk for developing disease. 

Any cell type or tissue, including brain, cortex, subcortical region, cerebellum, brainstem, 
olfactory bulb, spinal cord, eye, Harderian gland, heart, lung, liver, pancreas, kidney, spleen, thymus, 

20 lymph nodes, bone marrow, skin, gallbladder, urinary bladder, pituitary gland, adrenal gland, salivary 
gland, skeletal muscle, tongue, stomach, small intestine, large intestine, cecum, testis, epididymis, 
seminal vesicle, coagulating gland, prostate gland, ovary, uterus and white fat, in which the gene is 
expressed may be utilized in the diagnostics described below. 

DNA or RNA from the cell type or tissue to be analyzed may easily be isolated using 

25 procedures that are well known to those in the art. Diagnostic procedures may also be performed in situ 
directly upon tissue sections (fixed and/or frozen) of patient tissue obtained from biopsies or resections, 
such that no nucleic acid purification is necessary. Nucleic acid reagents may be used as probes and/or 
primers for such in situ procedures {see^ for example, Nuovo, PCR In Situ Hybridization: Protocols and 
Applications, Raven Press, N,Y. (1992)). 

30 Gene nucleotide sequences, either RNA or DNA, may, for example, be used in hybridization or 

amplification assays of biological samples to detect disease-related gene structures and expression. 
Such assays may include, but are not limited to, Southern or Northern analyses, restriction fragment 
length polymorphism assays, single stranded conformational polymorphism analyses, in situ 
hybridization assays, and polymerase chain reaction analyses. Such analyses may reveal both 

35 quantitative aspects of the expression pattern of the gene, and qualitative aspects of the gene expression 
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and/or gene composition. That is, such aspects may include, for example, point mutations, insertions, . 
deletions, chromosomal rearrangements, and/or activation or inactivation of gene expression. 

Preferred diagnostic methods for the detection of gene-specific nucleic acid molecules may 
involve for example, contacting and incubating nucleic acids, derived from the cell type or tissue being 
5 analyzed, with one or more labeled nucleic acid reagents under conditions favorable for the specific 
annealing of these reagents to their complementary sequences within the nucleic acid molecule of 
interest. Preferably, the lengths of these nucleic acid reagents are at least 9 to 30 nucleotides. After 
incubation, all non-annealed nucleic acids are removed from the nucleic acid:fingerprint molecule 
hybrid. The presence of nucleic acids from the fingerprint tissue that have hybridized, if any such 

10 molecules exist, is then detected. Using such a detection scheme, the nucleic acid from the tissue or 
cell type of interest may be immobilized, for example, to a solid support such as a membrane, or a 
plastic surface such as that on a microtitre plate or polystyrene beads. In this case, after incubation, 
non-annealed, labeled nucleic acid reagents are easily removed. Detection of the remaining, annealed, 
labeled nucleic acid reagents is accomplished using standard techniques well-known to those in the art. 

15 Alternative diagnostic methods for the detection of gene-specific nucleic acid molecules may 

involve their amplification, e.g., by PCR (the experimental embodiment set forth in MuUis U.S. Patent 
No. 4,683,202 (1987)), ligase chain reaction (Barany, Proc. Natl. Acad. Sci. USA, 88:189-93 (1991)), 
self sustained sequence replication (Guatelli, et al, Proc. Natl Acad. Sci. USA, 87: 1874-78 (1990)), 
transcriptional amplification system (Kwoh, etal, Proc. Natl Acad. ScL USA, 86:1173-77 (1989)), Q- 

20 Beta Replicase (Lizardi et al , Bio/T echnology, 6: 1 197 (1988)), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of nucleic acid 
molecules if such molecules are present in very low numbers. 

In one embodiment of such a detection scheme, a cDNA molecule is obtained from an RNA 

25 molecule of interest {e.g., by reverse transcription of the RNA molecule into cDNA). Cell types or 

tissues from which such RNA may be isolated include any tissue in which wild-type fingerprint gene is 
known to be expressed, including, but not limited, to brain, cortex, subcortical region, cerebellum, 
brainstem, olfactory bulb, spinal cord, eye, Harderian gland, heart, lung, liver, pancreas, kidney, spleen, 
thymus, lymph nodes, bone marrow, skin, gallbladder, urinary bladder, pituitary gland, adrenal gland, 

30 salivary gland, skeletal muscle, tongue, stomach, small intestine, large intestine, cecum, testis, 

epididymis, seminal vesicle, coagulating gland, prostate gland, ovary, uterus and white fat. A sequence 
within the cDNA is then used as the template for a nucleic acid amplification reaction, such as a PCR 
amplification reaction, or the like. The nucleic acid reagents used as synthesis initiation reagents {e.g., 
primers) in the reverse transcription and nucleic acid amplification steps of this method may be chosen 

35 from among the gene nucleic acid reagents described herein. The preferred lengths of such nucleic acid 
reagents are at least 15-30 nucleotides. For detection of the amplified product, the nucleic acid 
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amplification may be performed using radioactively or non-radioactively labeled nucleotides. 
Alternatively, enough amplified product may be made such that the product may be visualized by 
standard ethidium bromide staining or by utilizing any other suitable nucleic acid staining method. 
Antibodies directed against wild-type or mutant gene peptides may also be used as disease 
5 diagnostics and prognostics. Such diagnostic methods, may be used to detect abnormalities in the level 
of gene protein expression, or abnormalities in the structure and/or tissue, cellular, or subcellular 
location of fingerprint gene protein. Structural differences may include, for example, differences in the 
size, electronegativity, or antigenicity of the mutant fingerprint gene protein relative to the normal 
fingerprint gene protein. 

10 Protein from the tissue or cell type to be analyzed may easily be detected or isolated using 

techniques that are well known to those of skill in the art, including but not limited to western blot 
analysis. For a detailed explanation of methods for carrying out western blot analysis, see Sambrook, et 
ah (1989) supra, at Chapter 18. The protein detection and isolation methods employed herein may also 
be such as those described in Harlow and Lane, for example, (Antibodies: A Laboratory Manual, Cold 

15 Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1988)). 

Preferred diagnostic methods for the detection of wild-type or mutant gene peptide molecules 
may involve, for example, immunoassays wherein fingerprint gene peptides are detected by their 
interaction with an anti-fingerprint gene-specific peptide antibody. 

For example, antibodies, or fragments of antibodies useful in the present invention may be used 

20 to quantitatively or qualitatively detect the presence of wild-type or mutant gene peptides. This can be 
accomplished, for example, by inununofluorescence techniques employing a fluorescently labeled 
antibody (see below) coupled with light microscopic, flow cytometric, or fluorimetric detection. Such 
techniques are especially preferred if the fingerprint gene peptides are expressed on the cell surface. 
The antibodies (or fragments thereof) useful in the present invention may, additionally, be 

25 employed histologically, as in immunofluorescence or inununoelectron microscopy, for in situ 
detection of fingerprint gene peptides. In situ detection may be accomplished by removing a 
histological specimen from a patient, and applying thereto a labeled antibody of the present invention. 
The antibody (or fragment) is preferably applied by overlaying the labeled antibody (or fragment) onto 
a biological sample. Through the use of such a procedure, it is possible to determine not only the 

30 presence of the fingerprint gene peptides, but also their distribution in the examined tissue. Using the 
present invention, those of ordinary skill will readily perceive that any of a wide variety of histological 
methods (such as staining procedures) can be modified in order to achieve such in situ detection. 

Immunoassays for wild-type, mutant, or expanded fingerprint gene peptides typically comprise 
incubating a biological sample, such as a biological fluid, a tissue extract, freshly harvested cells, or 

35 cells that have been incubated in tissue culture, in the presence of a detectably labeled antibody capable 
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of identifying fingerprint gene peptides, and detecting the bound antibody by any of a number of 
techniques well known in the art. 

The biological sample may be brought in contact with and immobilized onto a solid phase 
support or carrier such as nitrocellulose, or other solid support that is capable of immobilizing cells, cell 
5 particles or soluble proteins. The support may then be washed with suitable buffers followed by 
treatment with the detectably labeled gene-specific antibody. The solid phase support may then be 
washed with the buffer a second time to remove unbound antibody. The amount of bound label on 
solid support may then be detected by conventional means. 

The terms "solid phase support or carrier" are intended to encompass any support capable of 

10 binding an antigen or an antibody. Well-known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, natural and modified celluloses, 
polyacrylamides, gabbros, and magnetite. The nature of the carrier can be either soluble to some extent 
or insoluble for the purposes of the present invention. The support material may have virtually any 
possible structural configuration so long as the coupled molecule is capable of binding to an antigen or 

15 antibody. Thus, the support configuration may be spherical, as in a bead, or cylindrical, as in the inside 
surface of a test tube, or the external surface of a rod. Alternatively, the surface may be flat such as a 
sheet, test strip, etc. Preferred supports include polystyrene beads. Those skilled in the art will know 
many other suitable carriers for binding antibody or antigen, or will be able to ascertain the same by use 
of routine experimentation. 

20 The binding activity of a given lot of anti-wild-type or -mutant fingerprint gene peptide 

antibody may be determined according to well known methods. Those skilled in the art will be able to 
determine operative and optimal assay conditions for each determination by employing routine 
experimentation. 

One of the ways in which the gene peptide-specific antibody can be detectably labeled is by 
25 linking the same to an enzyme and using it in an enzyme immunoassay (EIA) (VoUer, Ric Clin Lab, 
8:289-98 (1978) ["The Enzyme Linked Immunosorbent Assay (ELISA)", Diagnostic Horizons 2:1-7, 
1978, Microbiological Associates Quarterly Publication, Walkersville, Md.]; Voller, et ah, J. Clin, 
Pathol, 31:507-20 (1978); Butler, Meth, EnzymoL, 73:482-523 (1981); Maggio (ed.), Enzyme 
Inmiunoassay, CRC Press, Boca Raton, Fla. (1980); Ishikawa, et al, (eds.) Enzyme Immunoassay, 
30 Igaku-Shoin, Tokyo (1981)). The enzyme that is bound to the antibody will react with an appropriate 
substrate, preferably a chromogenic substrate, in such a manner as to produce a chemical moiety that 
can be detected, for example, by spectrophotometric, fluorimetric or by visual means. Enzymes that 
can be used to detectably label the antibody include, but are not limited to, malate dehydrogenase, 
staphylococcal nuclease, delta-5 -steroid isomerase, yeast alcohol dehydrogenase, alpha- 
35 glycerophosphate, dehydrogenase, triose phosphate isomerase, horseradish peroxidase, alkaline 

phosphatase, asparaginase, glucose oxidase, beta-galactosidase, ribonuclease, urease, catalase, glucose- 
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6-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. The detection can be accomplished 
by colorimetric methods that employ a chromogenic substrate for the enzyme. Detection may also be 
accomplished by visual comparison of the extent of enzymatic reaction of a substrate in comparison 
with similarly prepared standards. 
5 Detection may also be accomplished using any of a variety of other immunoassays. For 

example, by radioactively labeling the antibodies or antibody fragments, it is possible to detect 
fingerprint gene wild-type, mutant, or expanded peptides through the use of a radioimmunoassay (RIA) 
{see^ e.g.y Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on RadioHgand 
Assay Techniques, The Endocrine Society, March, 1986). The radioactive isotope can be detected by 

10 such means as the use of a gamma counter or a scintillation counter or by autoradiography. 

It is also possible to label the antibody with a fluorescent compound. When the fluorescently 
labeled antibody is exposed to light of the proper wave length, its presence can then be detected due to 
fluorescence. Among the most commonly used fluorescent labeling compounds are fluorescein 
isothiocyanate, rhodamine, phycoerythrin, phycocyanin, allophycocyanin, o-phthaldehyde and 

15 fluorescamine. 

The antibody can also be detectably labeled using fluorescence emitting metals such as ^^^Eu, 
or others of the lanthanide series. These metals can be attached to the antibody using such metal 
chelating groups as diethylenetriaminepentacetic acid (DTPA) or ethylenediamine-tetraacetic acid 
(EDTA). 

20 The antibody also can be detectably labeled by coupling it to a chemiluminescent compound. 

The presence of the chemiluminescent-tagged antibody is then determined by detecting the presence of 
luminescence that arises during the course of a chemical reaction. Examples of particularly useful 
chemiluminescent labeling compounds are luminol, isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

25 Likewise, a bioluminescent compound may be used to label the antibody of the present 

invention. Bioluminescence is a type of chemiluminescence found in biological systems in which a 
catalytic protein increases the efficiency of the chemiluminescent reaction. The presence of a 
bioluminescent protein is determined by detecting the presence of luminescence. Important 
bioluminescent compounds for purposes of labeling are luciferin, luciferase and aequorin. 

30 Throughout this application, various publications, patents and published patent applications are 

referred to by an identifying citation. The disclosures of these publications, patents and published 
patent specifications referenced in this application are hereby incorporated by reference into the present 
disclosure to more fully describe the state of the art to which this invention pertains. 

The following examples are intended only to illustrate the present invention and should in no 

35 way be construed as limiting the subject invention. 
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Examples 

Example 1: Generation of Mice Comprising CTSZ Disruptions 

To investigate the role of CTSZ, disruptions in CTSZ genes were produced by homologous 
recombination. Specifically, transgenic mice comprising disruptions in CTSZ genes were created. 
5 More particularly, as shown in Figure 4, a CTSZ -specific targeting construct based upon SEQ ID NO: 1 
or the sequence identified in AJ242663.1 GI:5019547, was created using as the targeting arms 
(homologous sequences) in the construct the oligonucleotide sequences identified herein as SEQ ID 
NO:3 or SEQ ID NO:4. 

The targeting construct was introduced into ES cells derived from 129/OlaHsd mouse substrain 
10 to generate chimeric mice. Fl mice were generated by breeding with C57BL/6 females. F2 

homozygous mutant mice were produced by intercrossing Fl heterozygous males and females. 
Example 2: Expression Analysis 

RT-PCR Expression. Total RNA was isolated from the organs or tissues from adult C57BL/6 
wild-type mice. RNA was DNasel treated, and reverse transcribed using random primers. The resulting 
15 cDNA was checked for the absence of genomic contamination using primers specific to non-transcribed 
genomic mouse DNA. cDNAs were balanced for concentration using HPRT primers. 

RNA transcripts were detectable in all tissues analyzed: brain, cortex, subcortical region, 
cerebellum, brainstem, olfactory bulb, spinal cord, eye, Harderian glands, heart, lung, liver, pancreas, 
kidney, spleen, thymus, lymph nodes, bone marrow, skin, gallbladder, urinary bladder, pituitary gland, 
20 adrenal gland, salivary gland, skeletal muscle, tongue, stomach, small intestine, large intestine, cecum, 
testis, epididymis, seminal vesicle, coagulating gland, prostate gland, ovaries, uterus and white fat. 

LacZ Reporter Gene Expression. In general, tissues from 7-12 week old heterozygous mutant 
mice were analyzed for lacZ expression. Organs from heterozygous mutant mice were frozen, 
sectioned (10 /im), stained and analyzed for lacZ expression using X-Gal as a substrate for beta- 
25 galactosidase, followed by a Nuclear Fast Red counterstaining. 

In addition, for brain, wholemount staining was performed. The dissected brain was cut 
longitudinally, fixed and stained using X-Gal as the substrate for beta-galactosidase. The reaction was 
stopped by washing the brain in PBS and then fixed in PBS-buffered formaldehyde. 

Wild-type control tissues were also stained for lacZ expression to reveal any background or 
30 signals due to endogenous beta-galactosidase activity. The following tissues can show staining in the 
wild-type control sections and are therefore not suitable for X-gal staining: small and large intestines, 
stomach, vas deferens and epididymis. It has been previously reported that these organs contain high 
levels of endogenous beta-galactosidase activity. 

RNA transcripts were detectable in all tissues analyzed: brain, cortex, subcortical region, 
35 cerebellum, brainstem, olfactory bulb, spinal cord, eye, Harderian glands, heart, lung, liver, pancreas, 
kidney, spleen, thymus, lymph nodes, bone marrow, skin, gallbladder, urinary bladder, pituitary gland. 
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adrenal gland, salivary gland, skeletal muscle, tongue, stomach, small intestine, large intestine, cecum, 
testis, epididymis, seminal vesicle, coagulating gland, prostate gland, ovaries, uterus and white fat. 
Example 3: Physical Examination 

A complete physical examination was performed on each mouse. Mice were first observed in 
5 their home cages for a number of general characteristics including activity level, behavior toward 

siblings, posture, grooming, breathing pattern and sounds, and movement. General body condition and 
size were noted as well identifying characteristics including coat color, belly color, and eye color. 
Following a visual inspection of the mouse in the cage, the mouse was handled for a detailed, stepwise 
examination. The head was examined first, including eyes, ears, and nose, noting any discharge, 

10 malformations, or other abnormalities. Lymph nodes and glands of the head and neck were palpated. 
Skin, hair coat, axial and appendicular skeleton, and abdomen were also examined. The limbs and 
torso were examined visually and palpated for masses, malformations or other abnormalities. The 
anogenital region was examined for discharges, staining of hair, or other changes. If the mouse 
defecates during the examination, the feces were assessed for color and consistency. Abnormal 

15 behavior, movement, or physical changes may indicate abnormalities in general health, growth, 
metabolism, motor reflexes, sensory systems, or development of the central nervous system. 
Example 4: Necropsy 

Necropsy was performed on mice following deep general anesthesia, cardiac puncture for 
terminal blood collection, and euthanasia. Body lengths and body weights were recorded for each 

20 mouse. The necropsy included detailed examination of the whole mouse, the skinned carcass, skeleton, 
and all major organ systems. Lesions in organs and tissues were noted during the examination. 
Designated organs, from which extraneous fat and connective tissue have been removed, were weighed 
on a balance, and the weights were recorded. Weights were obtained for the following organs: heart, 
liver, spleen, thymus, kidneys, and testes/epididymides. 

25 Example 5: Histopathological Analysis 

Harvested organs were fixed in about 10% neutral buffered formalin for a minimum of about 
48 hours at room temperature. Tissues were trinmied and samples taken to include the major features 
of each organ. If any abnormalities were noted at necropsy or at the time of tissue trimming, additional 
sample(s), if necessary, were taken to include the abnormalities so that it is available for microscopic 

30 analysis. Tissues were placed together, according to predetermined groupings, in tissue processing 
cassettes. All bones (and any calcified tissues) were decalcified with a formic acid or EDTA-based 
solution prior to trimming. 

The infiltration of the tissues by paraffin was performed using an automated tissue processor. 
Steps in the cycle included dehydration through a graded series of ethanols, clearing using xylene or 

35 xylene substitute and infiltration with paraffin. Tissues were embedded in paraffin blocks with a 
standard orientation of specified tissues within each block. Sections were cut from each block at a 
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thickness of about 3-5 /im and mounted onto glass slides. After drying, the slides were stained with 
hematoxylin and eosin (H&E) and a glass coverslip was mounted over the sections for examination. 
Example 6: Behavioral Analysis - Rotarod Test 

The Accelerating Rotarod was used to screen for motor coordination, balance and ataxia 
5 phenotypes. Mice were allowed to move about on their wire-cage top for 30 seconds prior to testing to 
ensure awareness. Mice were placed on the stationary rod, facing away from the experimenter. The 
"speed profile" programs the rotarod to reach 60 rpm after six minutes. A photobeam was broken when 
the animal fell, which stopped the test clock for that chamber. The animals were tested over three trials 
with a 20-minute rest period between trials, after which the mice were returned to fresh cages. The data 

10 was analyzed to determine the average speed of the rotating rod at the fall time over the three trials. A 
decrease in the speed of the rotating rod at the time of fall compared to wild-types indicated decreased 
motor coordination possibly due to a motor neuron or inner ear disorder. 
Example 7: Behavioral Analysis - Startle Test 

The startle test screens for changes in the basic fundamental nervous system or muscle-related 

15 functions. The startle reflex is a short-latency response of the skeletal musculature elicited by a sudden 
auditory stimulus. This includes changes in 1) hearing - auditory processing; 2) sensory and motor 
processing - related to the auditory circuit and culminating in a motor related output; 3) global sensory 
changes; and motor abnormalities, including skeletal muscle or motor neuron related changes. 

The startle test also screens for higher level cognitive functions. The startle reflex can be 

20 modulated by negative affective states like fear or stress. The cognitive changes include: 1) 

sensorimotor processing such as sensorimotor gating changes related to schizophrenia; 2) attention 
disorders; 3) anxiety disorders; and 4) thought disturbance disorders. 

The mice were tested in a San Diego Instruments SR-LAB sound response chamber. Each 
mouse was exposed to 9 stimulus types that were repeated in pseudo-random order ten times during the 

25 course of the entire 25-minute test. The stimulus types in decibels were: p80, p90, plOO, pllO, pl20, 
pp80, pl20, pp90, pl20, pplOO, and pl20; where p=40 msec pulse, pp=20 msec prepulse. The length 
of time between a prepulse and a pulse was 100 msec (onset to onset). The mean Vmax of the ten 
repetitions for each trial type was computed for each mouse. 
Example 8: Behavioral Analysis - Hot Plate Test 

30 The hot plate analgesia test was designed to indicate an animal's sensitivity to a painful 

stimulus. The mice were placed on a hot plate of about 55.5°C, one at a time, and latency of the mice to 
pick up and lick or fan a hindpaw was recorded. A built-in timer was started as soon as the subjects 
were placed on the hot plate surface. The timer was stopped the instant the animal lifted its paw from 
the plate, reacting to the discomfort. Animal reaction time was a measurement of the animal's 

35 resistance to pain. The time points to hindpaw licking or fanning, up to a maximum of about 60- 



wo 03/026403 



PCTAJS02/30506 



45 

seconds, was recorded. Once the behavior was observed, the animal was immediately removed from 

the hot plate to prevent discomfort or injury. 

Example 9: Behavioral Analysis - Tail Flick Test 

The tail-flick test is a test of acute nociception in which a high-intensity thermal stimulus is 
5 directed to the tail of the mouse. The time from onset of stimulation to a rapid flick/ withdrawal from 

the heat source is recorded. This test produces a simple nociceptive reflex response that is an 

involuntary spinally mediated flexion reflex. 

Example 10: Behavioral Analysis - Open Field Test 

The Open Field Test was used to examine overall locomotion and anxiety levels in mice. 
10 Increases or decreases in total distance traveled over the test time are an indication of hyperactivity or 

hypoactivity, respectively. 

The open field provides a novel environment that creates an approach-avoidance conflict 

situation in which the animal desires to explore, yet instinctively seeks to protect itself. The chamber is 

lighted in the center and has no places to hide other than the comers. A normal mouse typically spends 
15 more time in the comers and around the periphery than it does in the center. Normal mice however, will 

venture into the central regions as they explore the chamber. Anxious mice spend most of their time in 

the comers, with almost no exploration of the center, whereas bold mice travel more, and show less 

preference for the periphery versus the central regions of the chamber. 

Each mouse was placed gently in the center of its assigned chamber. Tests were conducted for 
20 10 minutes, with the experimenter out of the animals' sight. Immediately following the test session, the 

fecal boli were counted for each subject: increased boli are also an indication of anxiety. Activity of 

individual mice was recorded for the 10-minute test session and monitored by photobeam breaks in the 

X-, y- and z-axes. Measurements taken included total distance traveled, percent of session time spent in 

the central region of the test apparatus, and average velocity during the ambulatory episodes. Increases 
25 or decreases in total distance traveled over the test time indicate hyperactivity or hypoactivity, 

respectively. Alterations in the regional distribution of movement indicates anxiety phenotypes, i.e., 

increased anxiety if there is a decrease in the time spent in the central region. 

Example 11: Behavioral Analysis - Metrazol Test 

To screen for phenotypes involving changes in seizure susceptibility, the Metrazol Test was 
30 used. About 5mg/ml of Metrazol was infused through the tail vein of the mouse at a constant rate of 

about 0.375ml/min. The infusion caused all mice to experience seizures. Those mice who entered the 

seizure stage the quickest were thought to be more prone to seizures in general. 

The Metrazol test can also be used to screen for phenotypes related to epilepsy. Seven to ten 

adult wild-type and homozygote males were used. A fresh solution of about 5mg/ml pentylenetetrazole 
35 in approximately 0.9% NaCl was prepared prior to testing. Mice were weighed and loosely held in a 

restrainer. After exposure to a heat lamp to dilate the tail vein, mice were continuously infused with the 
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pentylenetetrazole solution using a syringe pump set at a constant flow rate. The following stages were 
recorded: first twitch (sometimes accompanied by a squeak), beginning of the tonic/clonic seizure, tonic 
extension and survival time. The dose required for each phase was determined and the latency to each 
phase was determined between genotypes. Alterations in any stage may indicate an overall imbalance in 
5 excitatory or inhibitory neurotransmitter levels. 

Example 12: Behavioral Analysis - Tail Suspension Test 

The tail suspension test is a single-trial test that measures a mouse's propensity towards 
depression. This method for testing antidepressants in mice was reported by Stem et ah, (1985, 
Psychopharmacology 85(3): 367-370) and is widely used as a test for a range of compounds including 

10 SSRI's, benzodiazepines, typical and atypical antipsychotics. It is believed that a depressive state can 
be elicited in laboratory animals by continuously subjecting them to aversive situations over which they 
have no control. It is reported that a condition of "learned helplessness" is eventually reached. 

Mice were suspended on a metal hanger by the tail in an acoustically and visually isolated 
setting. Total irrmiobility time during the six-minute test period was determined using a computer 

15 algorithm based upon measuring the force exerted by the mouse on the metal hanger. An increase in 
inmiobility time for mutant mice compared to wild-type mice may indicate increased "depression." 
Animals that ceased struggling sooner may be more prone to depression. Studies have shown that the 
administration of antidepressants prior to testing increases the amount of time that animals struggle 
Example 13: Hematological Analysis 

20 Blood samples were collected via a terminal cardiac puncture in a syringe. About one hundred 

microliters of each whole blood sample were transferred into tubes pre-filled with EDTA. 
Approximately 25 microliters of the blood was placed onto a glass slide to prepare a peripheral blood 
smear. The blood smears were later stained with Wright's Stain that differentially stained white blood 
cell nuclei, granules and cytoplasm, and allowed the identification of different cell types. The slides 

25 were analyzed microscopically by counting and noting each cell type in a total of 100 white blood cells. 
The percentage of each of the cell types counted was then calculated. Red blood cell morphology was 
also evaluated. 

Microscopic examinations of blood smears were performed to provide accurate differential 
blood leukocyte counts. The leukocyte differential counts were provided as the percentage composition 
30 of each cell type in the blood. 

Example 14: Densitometric Analysis 

Mice were euthanized and analyzed using a PIXImus™ densitometer. An x-ray source exposed 
the mice to a beam of both high and low energy x-rays. The ratio of attenuation of the high and low 
energies allowed the separation of bone from soft tissue, and, from within the tissue samples, lean and 
35 fat. Densitometric data including Bone Mineral Density (BMD presented as g/cm2). Bone Mineral 
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Content (BMC in g), bone and tissue area, total tissue mass, and fat as a percent of body soft tissue 
(presented as fat %) were obtained and recorded. 
Example 15: Development 

Animals were genotyped using one of two methods. The first method uses the polymerase 
5 chain reaction (PCR) with target-specific and Neo primers to amplify DNA from the targeted gene. 
The second method uses PCR and Neo primers to "count" the number of Neo genes present per 
genome. 

If homozygous mutant mice were not identified at weaning (3-4 weeks old), animals were 
assessed for lethality linked with the introduced mutation. This evaluation included embryonic, 

10 perinatal or juvenile death. 

Newborn mice were genotyped 24-48 hours after birth and monitored closely for any signs of 
stress. Dead/dying pups were recorded and grossly inspected and if possible, genotyped. In the case of 
perinatal death, late gestation embryos (-E19.5, i.e., 19.5 days post-coitum) or newborn pups were 
analyzed, genotyped and subject to further characterization. 

15 If there was no evidence of perinatal or juvenile lethality, heterozygous mutant mice were set 

up for timed pregnancies. Routinely, E10.5 embryos were analyzed for gross abnormalities and 
genotyped. Depending on these findings, earlier (routinely >E8.5) or later embryonic stages were 
characterized to identify the approximate time of death. If no homozygous mutant progeny were 
detected, blastocysts (E3,5) were isolated, genotyped directly or grown for 6 days in culture and then 

20 genotyped. Any suspected genotype-related gross abnormalities were recorded. 
Example 16: Fertility 

The reproductive traits of male and female homozygous mutant mice are tested to identify 
potential defects in spermatogenesis, oogenesis, maternal ability to support pre- or post-embryonic 
development, or manrunary gland defects and ability of the female knockout mice to nurse their pups. 

25 Homozygous mutant (-/-) mice of each gender were set up in a fertility mating with either a 

wild-type (+/+) mate or a homozygous mutant mouse of the opposite gender at about seven to about ten 
weeks of age. The numbers of pups bom from one to three litters were recorded at birth. Three weeks 
later, the live pups were counted and weaned. 

Males and females were separated after they had produced two litters or at six months (26 

30 weeks) of age, whichever comes first. 

Example 17: Inflanmiatory Bowel Disease Model 

Crohn's disease and ulcerative colitis, collectively referred to as Inflammatory bowel disease 
(IBD), affects about 1 in 1000 people in western cultures. Current treatment regimens, which include 
steroid treatment and immunosuppressives can ameliorate symptoms in patients; however, a high 

35 incidence of relapse and detrimental side effects suggest these treatments are less than ideal. Chronic 
IBD involves aspects of both the innate and adaptive immune response, in that initial destruction of 
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intestinal tissue during acute inflammation leads to a chronic T-cell mediated autoimmune disorder. An 
IBD model is used here to identify therapeutically relevant targets involved in IBD etiology and 
progression. Dextran sulfate sodium (DSS), a polymer frequently used by the pharmaceutical industry 
in animal models of IBD, is used to induce IBD in mice. 
5 Procedure: Female wild-type and homozygous mutant mice about 10-12 weeks of age were 

fed dextran sulfate sodium (DSS) orally in drinking water for 7 days (days 1-7) followed by 7 days of 
water alone (days 8-14). A second 7-day course of DSS was then given (days 15-21). The dose of DSS 
used was either no DSS (control) or 2% (weight/volume) DSS. Weight was monitored daily during the 
course of the study. Stool samples were analyzed for consistency (normal, loose or diarrhea) and were 

10 tested for hemoccult positivity or gross bleeding. The mice were analyzed daily for the presence of 

rectal bleeding. On day 22, following the second course of DSS, mice were sacrificed for necropsy and 
histological analysis of the small and large intestines. Results of the observations, necropsy and 
histological analysis were used to determine if a disruption of the target gene results in changes in 
sensitivity or resistance to disease progression in response to DSS. 

15 CTSZ gene exhibit decreased disease severity to inflammatory bowel disease. Specifically, the 

homozygous mice exhibit increased survival and decreased body weight loss as compared to wild-type 
mice. (See, for example. Figure 6 and Figure 7). 

Example 18: Antigen Induced Airway Hyperactivity Model for Asthma 

More than 17 million Americans are estimated to have asthma and the prevalence of the disease 

20 has increased 75% in the last 15 years. Acute asthmatic attacks are generally treated by 
bronchodilators, while preventative treatments included anti-inflammatory agents such as 
corticosteroids and, more recently, leukotriene modifiers. Like other allergies, asthma is the result of a 
hyperactive response by the immune system against an otherwise harmless protein (allergen; e.g. 
ragweed pollen) that includes infiltration of lung tissue by mast cells and eosinophils. Upon subsequent 

25 encounters with the allergen, an inmiediate hypersensitivity response occurs which results in impaired 
breathing and mucus formation. 

Immunization: Female wild-type and homozygous mutant mice about 8-12 weeks of age were 
used. For each mouse type (wild-type or homozygous mutant), about 8 mice will be immunized with 
ovalbumin and about 8 will be sham immunized (non-sensitized) as controls for immunizations and 

30 aerosol challenge. To induce antigen specific airway hypersensitivity, mice are immunized by 

intraperitoneal (i.p.) injection of 20 micrograms of purified ovalbumin emulsified with 2 milligrams of 
the adjuvant aluminum hydroxide in a solution of phosphate buffered saline (PBS) on days 0 and 14. 
Total volume of the immunogen/adjuvant is 0.2 milliliters. Non-sensitized mice receive injections of 
aluminum hydroxide without ovalbumin. 

35 Aerosol Challenge: On days 28, 29, and 30, mice are given an aerosol challenge of ovalbumin 

by placing mice in a plastic chamber and administering an aerosol of 1% ovalbumin (weight/volume) in 
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PBS created by ultrasonic nebulization. The aerosol is administered for 20 minutes each day. Non- 
sensitized mice receive aerosolized PBS without ovalbumin. 

Blood Collection: Blood is collected at day 0 prior to immunization, day 15 and day 31 
following plethysmography (see below). Blood (150 microliters) is collected by a tail vein nick. Blood 
5 is analyzed for the presence of IgE, and the amount of IgE present that is specific for ovalbumin. 

Measurement of Airway Hyperactivity: On day 31, airway activity is assessed by a non- 
invasive method, which measures breathing patterns in unrestrained animals. Mice are placed into a 
whole body plethysmograph and baseline readings of breathing patterns are taken for three minutes. 
Mice are exposed to nebulized PBS or methacholine (in a PBS solution) at doses ranging from three to 

10 50 milligrams per milliliter. Methacholine acts to induce bronchoconstriction in a dose dependent 

manner by activation of effector cells (predominantly eosinophils) that migrate to lung tissue following 
aerosol challenge with ovalbumin or that may be constituitively present as a result of a disruption in the 
target gene. Doses (4-5 total) are administered for three minutes each and breathing patterns are 
recorded for five minutes following each dose of inhaled methacholine or vehicle. Following the final 

1 5 recording of breathing patterns, mice are euthanized by CO2 prior to necropsy. Changes in the response 
to methacholine challenge in non-sensitized homozygous mutant mice relative to wild-type mice may 
be indicative of chronic obstructive pulmonary disease (COPD). Changes in the response to 
methacholine challenge in sensitized homozygous mutant mice relative to wild-type mice may be 
indicative of asthma. 

20 Example 19: Cutaneous Allergy Model (aka Contact Dermatitis) 

Cutaneous allergy is common in developed countries. Ten percent of U.S. children have 
allergic skin reactions such as dermatitis and cutaneous allergy represents the second most common 
occupational disease affecting about 75 in 100,000 workers. Contact allergy can be broadly categorized 
as allergic contact dermatitis (ACD), requiring multiple exposures to the allergen, and irritant contact 

25 dermatitis (ICD), which occurs following an initial exposure. Both ACD and ICD involve a local 

inflammatory response mediated by leukocytes such as neutrophils, monocytes, macrophages and mast 
cells. Unlike ICD, ACD also involves a T-cell component since the affected individual must be 
exposed to the allergen more than one time. These inflammatory reactions involve a complex interplay 
between blood borne cells, blood vessels and the affected tissue itself. 

30 Allergic Contact Dermatitis (ACD) - Procedure: On day 0, wild-type and homozygous mutant 

mice have their abdomen shaved and are primed with the allergen, a 1% (weight/volume) solution of 
fluorescein isolthiocyanate (FITC; in acetone). Priming is performed by placing 100 microliters of the 
FITC solution on the shaved abdomen and 5 microliters of the solution on each footpad. On day 6, the 
ear thickness of each mouse is measured with vernier calipers or an engineer's micrometer to establish 

35 a baseline ear thickness. To each mouse 10 microliters of a 0.5% FITC (in acetone) solution is placed 
on both sides of the right ear. The acetone solution (without FITC) is added to the left ear. Ear 
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thickness is measured again on day 7, and the mice are sacrificed for tissue harvest. Ear swelling is 
estimated by determining the change in ear thickness from baseline to 24 hours post-challenge. Ear 
swelling is indicative of an allergic response to the antigen. Histological changes, including changes in 
the cell types infiltrating the ear, may also be looked at to see changes in the type of inflammation. 
5 Other means of measuring inflammation, such as thermogenic imaging to gauge temperature changes, 
may also be used. 

Irritant Contact Dermatitis (ICD) - Procedure: On day 0, the ear thickness of each mouse 
(wild-type and homozygous mutant) is measured with vernier calipers or an engineer's micrometer. 
Ten microliters of a 2% croton oil (volume/volume) solution (in a 4:1 acetone:olive oil base) is applied 
10 to the top third of both ears of restrained animals. On day 7, the ear thickness of each mouse is 

measured, and the mice are sacrificed for tissue harvest. Ear swelling is estimated as above (see ACD 
procedure). 

Example 20: Peritonitis Model 

A major hallmark of inflammatory responses is the influx of neutrophils and monocytes to sites 

15 of tissue damage or infection. This process involves the interplay of cytokines and chemokines 
produced by non-hematopoietic cells and cognate receptors on leukocytes. Once attracted, these 
leukocytes become activated releasing still more cytokines and chemokines which induce most aspects 
of an inflammatory response. In order to more fully understand the pathways involved in the initial 
steps of inflammation, and to discover therapeutically relevant intervention points, animal models of 

20 inflammation are necessary. In a mouse model of peritonitis, mice are given an injection of 

thioglycollate broth (a yeast protein based media) to induce peritoneal inflammation. This site of 
inflammation allows easy isolation of infihrating cells for in vitro study. In this study some aspects of 
the early inflammatory process, namely neutrophil recruitment and macrophage activation, in wildtype 
and gene deficient animals are compared. 

25 Procedure: Thioglycollate (Sigma) is reconstituted to 3% weight/volume in sterile water. Mice are 
administered 1 milliliter of the thioglycollate solution or an equal volume of sterile water (for control 
mice) i.p.. At 1 or 3 days post thioglycollate challenge, mice are sacrificed. Some aspects of the early 
inflanunatory process, namely neutrophil recruitment and macrophage activation will be compared in 
homozygous mice relative to wild-type mice to examine the role of GENE NAME in the inflanunatory 

30 process. Example 21: Induction of Arthritis 

Rheumatoid arthritis is a common autoimmune disease characterized by joint swelling, 
deformation and, ultimately, destruction, culminating in severe physical disability. Rheumatoid 
arthritis is a progressive condition with well-recognized symptoms including symmetrical peripheral 
joint swelling and synovial inflammation. 

35 Transgenic mice of the present invention were tested in a model of rheumatoid arthritis to 

investigate the role of CTSG in the disease. Arthritis was induced by the intravenous (i.v.) 
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administration of a cocktail of monoclonal antibodies (mAbs) specific for arthrogenic epitopes of type 
CII collagen. These antibodies were purchased as a kit from Chemicon and are specific to peptide 
fragments derived from the CBl 1 region of type II collagen (Cat # ECM1400). Twenty-four hours after 
each injection of mAbs, mice received an intraperontoneal (i.p.) administration (SOP # 00023) of about 
5 25 iig lipopolysaccharide (LPS, Sigma), which served to boost the arthrogenic response of the mAbs. 
In the model validation study, mice received about 8 mg of mAbs as either a single administration or as 
two administrations of about 4 mg each spaced about 48 hr apart. 

Disease monitoring: Mice were monitored daily following the first injection of mAbs for signs 
of articular inflammation. Disease severity was scored based upon the number of joints involved and 

10 the severity of inflammation (i.e. presence of redness, joint swelling, # of digits involved). Mice were 
sacrificed 7 days after the first administration of mAbs. This time point was chosen based on mAb 
manufacturer's recommendation and personal communication with independent researchers (J. Mobley, 
Pfizer). At 7 days, incidence of inflammation was 100% and was prior to onset of disease resolution. 
Mice were sacrificed with CO2. 

15 Results: Homozygous mice exhibited resistance to joint inflammation induced by arthritogenic 

antibodies. Specifically homozygous mice (KO) were demonstrated decreased disease severity in this 
model of rheumatoid arthritis as to their wild-type (WT) counterparts. More particularly, the 
homozygous mice exhibited no disease progression or development of rheumatoid arthritis (See, for 
example, Figure 5). 

20 These results indicate that CTSG may provide a valuable target for the identification of 

therapeutic agents for the treatment of rheumatoid arthritis. As shown by these results, antagonism of 
CTSG may provide resistance to or treatment of rheumatoid arthritis. 
Example 22: Septic Shock 

Septicemia is the 9*^ leading cause of death in the U.S. and sepsis in newborns represents a 

25 significant mediator of infant mortality. Septicemia is mediated by the pleiotropic effects of several 
inflammatory cytokines including TNF-a, BL-la and IL-6 on multiple organ systems throughout the 
body. Septicemia is induced by the activation of monocytes and macrophages by systemic exposure to 
bacteria an bacterial cell wall products. The release of inflammatory cytokines by macrophages 
resident in blood and the peritoneal cavity is a common feature of most inflammatory responses. 

30 Therefore, delineating pathways critical to the release of these cytokines and finding targets that inhibit 
this process may have therapeutic value to a multitude of other diseases besides sepsis. In this model, 
mice will receive a challenge of a bacterial cell wall component, lipopolysaccharide (LPS), and the 
expression of inflammatory cytokines in serum will be measured in order to determine the role specific 
targets play in the initial inflammatory cascades involved in septic shock. 

35 LPS Challenge, Lipopolysaccharide (LPS) purchased from Sigma and reconstituted to 1 mg/ml 

in PBS prior to administration to mice! Mice will receive 100 ug of LPS (in 0.1 ml volume) or an equal 
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volume of PBS (control groups) i.p. using a 27 gauge needle and hand held restraint. Following 
injection the mice will be placed back into their cage. 

Separate cohorts of mice will have cardiac punctures performed following lethally dosed, 
avatine anesthesia at 1.5 and 6 hr post challenge. These time points have been chosen in order to 
5 capture the peak serum levels of specific inflammatory cytokines. Crdiac punctures will be performed 
by trained members of the necropsy staff. 

The dose of LPS is considered a lethal dose for mice of the C57B1/6 background, but death 
does not occur until 24 hr post challenge. Therefore, no mice are expected to succumb to the LPS 
during the time course of this experiment. However, LPS does induce fever in mice within 6 hours. To 
10 detect serum levels of inflammatory cytokines, including DL-la, blood is collected at 6 hr post 
challenge. 

As is apparent to one of skill in the art, various modifications of the above embodiments can be 
made without departing from the spirit and scope of this invention. These modifications and variations 
are within the scope of this invention. 
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CLAIMS 

We claim: 

1. A transgenic mouse comprising a disruption in a CTSZ gene. 

2. A transgenic mouse comprising a disruption in a CTSZ gene, wherein there is no native expression 
5 of CTSZ gene. 

3. The transgenic mouse of claim 2, wherein the disruption is heterozygous. 

4. The transgenic mouse of claim 2, wherein the disruption is homozygous. 

5. The transgenic mouse of claim 4, wherein the transgenic mouse exhibits resistance to the 
development of rheumatoid arthritis. 

10 6. The transgenic mouse of claim 4, wherein the transgenic mouse having a symptom characteristic of 
inflammatory bowel disease exhibits an increased survival rate as compared to wild-type mice. 

7. A method of producing a transgenic mouse comprising a disruption in a CTSZ gene, the method 
comprising: 

(a) providing a murine stem cell comprising a disruption in a CTSZ gene; and 
15 (b) introducing the murine stem cell into a pseudopregnant mouse, wherein the pseudopregnant 

mouse gives birth to a transgenic mouse. 

8. The transgenic mouse produced by the method of claim 7. 

9. A targeting construct comprising: 

(a) a first polynucleotide sequence homologous to at least a first portion of a CTSZ gene; 
20 (b) a second polynucleotide sequence homologous to at least a second portion of a CTSZ gene; 

and 

(c) a selectable marker. 

10. A cell comprising a disruption in a CTSZ gene, the disruption produced using the targeting 
construct of claim 9. 

25 1 1. A cell derived from the transgenic mouse of claim 1. 

12. A cell comprising a disruption in a CTSZ gene. 

13. The cell of claim 12, wherein the cell is a stem cell. 

14. The cell of claim 13, wherein the stem cell is an embryonic stem cell. 

15. The cell of claim 15, wherein the embryonic stem cell is a murine cell. 

30 16. A method of identifying an agent that modulates a symptom associated with rheumatoid arthritis, 
the method comprising: 

(a) contacting a test agent with CTSZ; and 

(b) determining whether the agent modulates CTSZ. 

17. A method of identifying an agent that modulates a symptom associated with inflammatory bowel 
35 disease, the method comprising: 

(a) contacting a test agent with a CTSZ; and 
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(b) determining whether the agent modulates the CTSZ. 
18. A method of identifying an agent for the treatment of rheumatoid arthritis, the method comprising: 

(a) contacting a test agent with CTSZ; and 

(b) determining whether the agent modulates CTSZ. 

5 19. A method of identifying an agent that modulates inflammatory bowel disease, the method 
comprising: 

(a) contacting a test agent with a CTSZ; and 

(b) determining whether the agent modulates the CTSZ molecule. 

20. A method of identifying an agent that modulates a phenotype associated with a disruption in CTSZ, 
10 the method comprising: 

(a) administering a test agent to an animal exhibiting the phenotype; and 

(b) determining whether the agent modulates the phenotype. 

21. A method of evaluating a potential therapeutic agent capable of affecting a condition associated 
with CTSZ, the method comprising: 

15 (a) administering the potential therapeutic agent to a transgenic mouse comprising a disruption 

in a CTSZ gene; and 

(b) evaluating the effects of the agent on the transgenic mouse. 

22. A method of evaluating a potential therapeutic agent capable of affecting a condition associated 
with CTSZ, the method comprising: 

20 (a) contacting the potential therapeutic agent with CTSZ; 

(b) evaluating the effects of the agent on the CTSZ. 

22. A method of identifying an agent capable of modulating a condition associated with CTSZ, the 
method comprising: 

(a) providing a first preparation derived from the mouse of claim 2; 
25 (b) providing a second preparation derived from a wild-type mouse; 

(c) contacting a test agent with the first and second preparations; and 

(d) determining whether the agent interacts with the first and second preparations, wherein 
interaction with the second preparation in the absence of interaction with the first 
preparation identifies a potential therapeutic agent. 

30 23. A therapeutic agent for treating inflanunatory bowel disease, wherein the agent modulates CTSZ. 

24. A therapeutic agent for treating rheumatoid arthritis, wherein the agent is an agonist or antagnonist 
of CTSZ. 

25. A pharmaceutical composition comprising CTSZ. 

26. A method of preparing a pharmaceutical composition for a condition associated with a function of 
35 CTSZ, the method comprising: 

(a) identifying a compound that modulates CTSZ; 
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(b) synthesizing the identified compound; and 

(c) incorporating the compound into a pharmaceutical carrier. 

27. The method of claim 26, wherein the condition is rheumatoid arthritis or inflammatory bowel 
disease. 

5 28. Phenotypic data associated with a transgenic mouse comprising a disruption in a CTSZ gene, 
wherein the phenotypic data is in an electronic database. 
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AAGTGTCCGGGCGGGGCGTCCCGAGCGCGTGGGCCTGCGGGTCGGGTCAAGAGGTAGAAGGTGCTGCGCGTGATCC 
AGGATCCGAATTGGCCCGGAGCAGGAGCATGGCGTCGTCGGGGTCGGTGCAGCAGCTGCCCCTGGTGCTGCTGATG 
TTGCTGTTGGCGAGTGCGGCACGGGCCAGACTCTACTTCCGCTCGGGCCAGACTTGCTACCATCCCATTCGCGGGG 
ACCAGCTGGCTCTGCTGGGGCGCAGGACTTATCCTCGGCCGCATGAGTACCTGTCCCCAGCGGATCTCCCCAAGAA 
TTGGGACTGGAGAAATGTGAACGGTGTCAACTATGCCAGCGTCACCAGGAACCAGCACATCCCACAGTACTGTGGT 
TCCTGCTGGGCCCACGGCAGCACCAGTGCCATGGCAGACCGAATCAACATCAAGAGGAAAGGTGCATGGCCCTCCA 
TCCTGCTGTCCGTACAGAATGTCATTGACTGTGGCAATGCTGGCTCTTGTGAAGGGGGCAATGACCTTCCGGTGTG 
GGAGTATGCCCACAAGCATGGCATCCCCGATGAGACCTGCAACAACTACCAGGCCAAGGACCAAGACTGTGACAAG 
TTTAACCAGTGTGGGACCTGCACTGAATTCAAAGAGTGTCACACCATCCAGAATTACACCCTCTGGAGAGTGGGTG 
ATTACGGCTCCCTGTCCGGGAGGGAGAAGATGATGGCCGAGATCTATGCCAATGGTCCCATCAGCTGCGGGATAAT 
GGCAACAGAGATGATGTCTAACTACACTGGGGGCATCTATGCTGAGCACCAGGACCAGGCCGTTATCAACCACATC 
ATCTCTGTAGCTGGCTGGGGTGTCAGCAACGATGGCATCGAGTACTGGATTGTCCGAAATTCATGGGGCGAACCCT 
GGGGTGAGAAAGGCTGGATGAGGATCGTGACCAGCACCTACAAGGGAGGCACAGGTGACAGCTACAACCTTGCCAT 
CGAGAGTGCCTGCACATTTGGGGACCCCATTGTTTAGGTAGATGTCTCTGGAAGCAGCGCTGTGAACCATGACAGG 
GAGGGGTGATTAATTACTGACACTGGACATGTCCAGACAGCTATAAACAGTGCTTGTGGACATGAGGACCAGAGTG 
TGGACTGCATCCCGAGAGGAGACGGTAAAGGATGAAACACAACTGCACTGGGACCCTCCGCCGTACGCTCCAGGCC 
TGCCTCCTTCAGCACTGAGCCCTCAGGCCTGCCTCCTCTTCTACAGTGCGTGCCTTCAGCCACCGGAGAAGAGAG 
(SEQ ID N0:1) 

FIGURE 1 



MAS SGS VQQLiPLVLLMLLLASAARARLYFRSGQTC YHP I RGDQLALLGRRTYPRPHE YL S PADLPKNWDWRNVNGV 
NYASWRNQHIPQYCGSCWAHGSTSAMADRINIKRKGAWPSILLSVQNVIDCGNAGSCEGGNDLPVWEYAHKHGIP 
DETCNlSnfQAKDQDCDKFNQCGTCTEFKECHTIQNYTLWRVGDYGSLSGREKiy^ 

GGIYAEHQDQAVINHIISVAGWGVSOTGIEYWIVRNSWGEPWGEKGWMRIVTSTYKGGTGDSYNLAIESACTFGDP 
IV (SEQ ID N0:2) 



FIGURE 2 



underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



AAGTGTCCGGGCGGGGCGTCCCGAGCGCGTGGGCCTGCGGGTCGGGTCAAGAGGTAGAAG 

GTGCTGCGCGTGATCCAGGATCCGAATTGGCCCGGAGCAGGAGCATGGCGTCGTCGGGGT 

CGGTGCAGCAGCTGCCCCTGGTGCTGCTGATGTTGCTGTTGGCGAGTGCGGCACGGGCCA 

GACTCTACTTCCGCTCGGGCCAGACTTGCTACCATCCCATTCGCGGGGACCAGCTGGCTC 

TGCTGGGGCGCAGGACTTATCCTCGGCCGCATGAGTACCTGTCCCCAGCGGATCTCCCCA 

AGAATTGGGACTGGAGAAATGTGAACGGTGTCAACTATGCCAGCGTCACCAGGAACCAGC 

ACATCCC ACAGTACTGTGGTTCCTGCTGGGCCC ACGGC AGC ACC AGTGCC ATGG [ C AGACC 

GAATCAACATCAAGAGGA] AAGGTGCATGGCCCTCCATCCT [ GCTGTCCGTAC AGAATGTCA 

TTGACTGTGGCAATGCTGGCTCTTGTGAAGGGGGCAATGACCTTCCGGTGTGGGAGTATG 

CCCACAAGCATGGCATCCCCGATGAGACCTGCAACAACTACCAGGCCAAGGACCAAG] ACT 

GTGACAAGTTTAACCAGTGTGGGACCTGCACTGAATTCAAAGAGTGTCACACCATCCAGA 

ATTACACCCTCTGGAGAGTGGGTGATTACGGCTCCCTGTCCGGGAGGGAGAAGATGATGG 

CCGAGATCTATGCCAATGGTCCCATCAGCTGCGGGATAATGGCAACAGAGATGATGTCTA 

ACTACACTGGGGGCATCTATGCTGAGCACCAGGACCAGGCCGTTATCAACCACATCATCT 

CTGTAGCTGGCTGGGGTGTCAGCAACGATGGCATCGAGTACTGGATTGTCCGAAATTCAT 

GGGGCGAACCCTGGGGTGAGAAAGGCTGGATGAGGATCGTGACCAGCACCTACAAGGGAG 

GCACAGGTGACAGCTACAACCTTGCCATCGAGAGTGCCTGCACATTTGGGGACCCCATTG 

TTTAGGTAGATGTCTCTGGAAGCAGCGCTGTGAACCATGACAGGGAGGGGTGATTAATTA 

CTGACACTGGACATGTCCAGACAGCTATAAACAGTGCTTGTGGACATGAGGACCAGAGTG 

TGGACTGCATCCCGAGAGGAGACGGTAAAGGATGAAACACAACTGCACTGGGACCCTCCG 

CCGTACGCTCCAGGCCTGCCTCCTTCAGCACTGAGCCCTCAGGCCTGCCTCCTCTTCTAC 

AGTGCGTGCCTTCAGCCACCGGAGAAGAGAG 
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Gene Sequence 
Structure * 

Size of full-length 
cDNA: 1291 bp 



439 bp 



Sequence Deleted 



460 bp 




— It t^^^-^ ^ 



Targeting Vector* 
(genomic sequence) 

Construct Number: 6699 



Arm Length: 
5':6.5kb 
3': 1 kb 



. . Tai^ting Vettor; 
* Not drawn to scale 



LacZ-Neo 




5 • >GGACCTCTAAACCCGGCCCTG 

CCTCATGTCACTTGCAGACAGACA 

GGCAGATGTCATGGCGCTGAGGAA 

GACCAGAGAAAGTCTCTGAAGGAC 

CGTGGGGCAGAGCCCTGGGACGTT 

TGCTGGGGCAGTCTTGGTGTTGGG 

GGGAACGTGGGTGCTGACCAGAGT 

GCTGCCTCTTGCAGACCGAATCAA 

CATCAAGAGGAO • (SEQ ID 

NO: 3) 



5 • >GCTGTCCGTACAGAATGTCAT 

TGACTGTGGCAATGCTGGCTCTTG 

TGAAGGGGGCAATGACCTTCCGGT 

GTGGGAGTATGCCCACAAGCATGG 

CATCCCCGATGAGACCTGCAACAA 

CTACCAGGCCAAGGACCAAGGTAG 

ACCCTGCCTATGTTCCCACATTGC 

TACACACCTGTCTCGTTCCAAGGG 

AGAGTGGCTGA<3« (SEQ ID 

N0:4) 



FIGURE 4 
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